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1. Status of the problem. Summary 


After the classical researches of CHRISTOFFEL, HuGontoT, HADAMARD, and 
_DUHEM on waves in elastic materials, it might seem that little remains to be 
learned. Such is not the case. As for most parts of mechanics, it has been 
necessary in the last decade to go over the matter again, not only so as to free 
the conceptual structure from lingering linearizing and to fix it more solidly 
in the common foundation of modern mechanics, but also so as to derive from 
it specific predictions satisfying modern needs for contact between theory and 
‘rationally conceived experiment. After reading the recent papers by Toupin 
_& BERNSTEIN [1961,1] and by Haves & RIvLINn [1961, 2], I have seen that 
more can be learned than is there proved. In the present paper I follow Toupin 
& BERNSTEIN’S approach to the general theory yet try to achieve the elegant 
and explicit directness of ERICKSEN’S earlier treatment of isotropic incompressible 
materials [1953]. At the same time, all the results of Haves & RIVLIN are 
obtained in shorter but more general form as immediate corollaries. 
Arch. Rational Mech. Anal., Vol. 8 19 
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It should not be thought, however, that this paper serves only as a condensing 
summary. Rather, I consider the general theory of elasticity instead of its 
special case, which was the starting point of all previous authors and whig 
will here be called hyperelastic, when the stresses are derivable from a stored 
energy. The more general elastic theory, though as yet but dimly known, is 
of some interest in itself: In statics, any simple material according to the defi- 
nition of Nori [1958] is an elastic material, though not necessarily a hyper 
elastic one, and also R1viin [1959, 2] [1960] has shown that for certain classes: 
of deformations a fairly general kind of material is approximately elastic, though. 
again not necessarily hyperelastic. These observations, while not particularly; 
relevant to the problem of wave propagation, give the more general theory of. 
elasticity some individual status, although there are various energy theorems: 
standing in favor of the hyperelastic special case. 

In a given elastic material subject to a given state of strain, a wave may 
propagate in a given direction only if its amplitude is parallel to an acoustic: 
axis; corresponding to each axis is a uniquely determined absolute speed of: 
propagation. In a hyperelastic material there always exists an orthogonal real. 
triad of acoustic axes, though the corresponding speeds are not necessarily real.. 
For a general elastic material, the possibilities are more various and less definite; 
in some cases there may be but one real acoustic axis, and if there are three,: 
they need not be orthogonal. However, it turns out that the assumption of 
isotropy alone, regardless of whether or not a stored energy exists, suffices to 
derive some definite acoustical theorems in a generality greater than any here- 
tofore known even for the hyperelastic case. 

These theorems may be foreshadowed by physical reasoning. Waves travelling 
in the direction of a unit vector m in a general elastic material face a complex: 
of three triads of generally distinct special directions: (1) the vectors d, in terms 
of which the symmetries of the material may be specified, (2) the principal axes: 
of stress, (3) the principal axes of strain in the deformed material. From these: 
three sets of directions, as well as from the local rotation, the acoustic axes fo 
the direction n are somehow determined. In an isotropic material, however, 
the vectors d, are arbitrary, the principal axes of stress and strain coincide, 
and the local rotation is irrelevant. Thus the acoustic axes depend upon one 
orthogonal set of directions in the material and, of course, upon n. When 7 
is parallel to a principal axis of strain and stress, there remain but three direc- 
tions, namely, the principal axes themselves, to influence the acoustic response.4 
Thus we may expect, and indeed I shall prove, that in an isotropic elastic material 
the acoustical axes for a wave travelling down a principal axis of stress and straim 
coincide with the principal axes. In particular, a wave travelling down a princip 
axis of stress and strain in an isotropic body is always either longitudinal o 
transverse. Finally, for these waves I derive explicit and exact formulae for the 
speeds of propagation. Thus for isotropic elastic materials the presence or absence 
of a stored energy is of secondary importance in problems of wave propagation... 

These same results may be motivated also by starting from the infinitesima 
theory. In that theory, the material is virtually unstressed and unstrained as: 
far as the passage of waves is concerned. The only preferred directions are thos 
of the material itself, and these determine the acoustic axes for each directio 
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)of propagation. In the isotropic case, there are no preferred material directions, 
/and the only direction which can influence the acoustic axes is that of the wave 
mormal. Hence, as is well known, every wave is necessarily either normal or 
transverse; any transverse amplitude is possible; and all waves of any one of 
‘these two kinds have a common absolute speed of propagation. In a finitely 
‘strained body, in general, this simple result holds no longer!. Rather, on a 


wave travelling in a severely strained isotropic body the existence of principal 
axes of stress and strain must have much the same effect as does that of the 


-axes of symmetry of an unstrained anisotropic material. Only when the wave 
‘itself travels down one of these axes is the total number of special directions 


reduced to three, and there is then no reason for the acoustic axes to be other 
than these. Thus, again, we are led to expect the results announced above. 

But for one special case, namely, that of a material subject to hydrostatic 
pressure, simplicity comparable to that of the linearized theory remains in force 
for strains of any magnitude. In this case, I show that every wave is necessarily 
either transverse or longitudinal; that all waves of each kind have the same 
absolute speed; and that a simple identity connects the bulk modulus, the 
absolute speed of the longitudinal waves, and the absolute speed of the trans- 
verse waves. It follows that longitudinal waves always travel faster than trans- 
verse waves, and that if transverse waves are possible, the absolute speed of 
longitudinal waves is always greater than the speed of sound according to the 
hydrodynamical theory for the same law of compression. 

Returning to arbitrary states of strain, I derive a number of universal 
formulae, valid for all possible forms of the stress-strain relations for an isotropic 


/ material, which connect the speeds of the 9 different kinds of waves that can 
travel along the principal axes of stress and strain. In particular, the speeds 


of any 2 of the 6 possible kinds of transverse waves determine explicitly those 
of the remaining 4. These formulae serve as conditions of compatibility which 
must be satisfied by measured values of the speeds if they are to be consistent 
with the theory of elasticity for isotropic materials. Supposing these conditions 
to be satisfied, I find simple explicit equations determining the form of the 
stress-strain relations directly from the speeds of the longitudinal waves along 
the three principal axes and from the speeds of the two kinds of transverse waves 
along any one of them. 

From these results follows a differential criterion whereby it can be determined 
from the wave speeds as functions of the principal stretches whether or not 
an isotropic elastic material be hyperelastic. 

Conditions under which the wave speeds are real are then investigated. 
Among other things, it is shown that the inequalities of BAKER & ERICKSEN 


are necessary and sufficient that all six kinds of transverse waves in isotropic 


] 


| materials have real speeds of propagation. 


By expanding the exact formulae for the wave speeds in powers of the three 
principal extensions, I calculate the wave speeds according to the second-order 


theory of elasticity. The results of L. Brirtouin for hydrostatic pressure and 


1 HaADAMARD [1903, pp. 274—277] determined the very special form the stored 


energy must have in order that every wave, whatever its direction and whatever 


the strain, be necessarily either longitudinal or transverse. 
19* 
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of Toupin & BERNSTEIN for bi-axial strain are included as special cases. Certain 


algebraic conditions must be satisfied by the 9 wave speeds in order that they 


be compatible with the second-order theory; the 4 second-order elasticities are | 
then determined explicitly by any 2 transverse-wave speeds and any 2 longi- | 


tudinal-wave speeds. I show also that a simple relation between one longitudinal- 
wave speed and one transverse-wave speed is a necessary and sufficient condition 
that there be a stored energy in the second-order theory. So far as I know, this 
result affords the first chance for a direct and clear experimental test of the 
existence of a stored energy. 

Finally I present new and simple proofs of three theorems of DUHEM con- 
necting the wave speeds in elastic materials conducting heat according to 
FouriEr’s law with those in elastic materials that do not conduct heat at all. 


2. General theory of acceleration waves+ 


At a singular surface of second order with unit normal m the following 
geometrical and kinematical conditions of compatibility [CFT, Eqs. (190.1), 


[ 


(190.2), (190.5)] must be satisfied by the jumps [x*,,] and [%*] in the second , 


derivatives x*,, of the deformation «*=x«*(X%,t) and by the jump of the 
acceleration x": 
[ap] = ah 0 1 gM Mp, [HJ = UPR a", (2.1) 


where @ is a vector which may be called the amplitude and where U is the © 


speed of propagation. The corresponding dynamical condition [CFT, Eq. (205.5) ] is © 


Le | Nin = 0 ? (2.2) 
where # is CAUCHY’S stress tensor. 


To begin with, we take the constitutive equation of elasticity in fully general 
curvilinear co-ordinate systems at X and a in the form? 


T= hy" (x™ g, ha), (2.3) 


where T is the first PloLA-K1RCHHOFF stress tensor’, where is an arbitrary 
function which is assumed to be as smooth as desired, and where the three 
vectors d, constitute the natural basis of the material co-ordinate system for 
the particle in question. The dynamical condition (2.2) is satisfied trivially by 


* While this paper is self-contained, the reader is expected to know the principles 
of modern continuum mechanics; for ease of reference in case of doubt, by using 
the prefix CFT we cite the relevant equations or sections in TRUESDELL & TOUPIN’S 
The Classical Field Theories, with an appendix on Invariants by J. L. EricKseEn, in 
FLUGGE’s Handbuch der Physik, vol. 3/1, p. 225—793, 1960. 

* From these formulae it is clear that the method of Frnzr [1 942], which is akin 
to that we use in §7 for isotropic materials, is not general. In fact, Finzi expressly 
limited himself to infinitesimal deformations, although he allowed the stress-strain 
relations to be of arbitrary form. 

* Cf. CFT, §210. For the purposes of the present paper we may take T as defined 
formally by the relation 


ee pie oe (A) 


where g and @ are the mass densities in the present configuration and in the reference | 


configuration, respectively, 


; 
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¢ (2.3), since x”, and hence also €, is continuous at the singular surface. In terms 
» of T, Caucuy’s first law of motion assumes the form [CFT, Eq. (210.8)] 


} 


Lek Opi O Pps (2.4) 


' where f is the extrinsic force per unit mass, where @ is the mass density in the 
_ reference configuration, and where the semicolon denotes the total covariant 
| derivative [CFT, App., §20]. 


Let us set / ab,2 
Arges Cee di) = aa” . (2.5) 


Substitution of (2.3) into (2.4) then yields equations of motion in the form 
Ong 


(aB) .m 
Ag, x 0B TS am 


Qa ON O Aer (2.6) 
Assuming that 4,,(°), o§%/dda", a!.,, 0, and f, are continuous across the singular 
surface*, we calculate the jump of (2.6), use (2.1), and so obtain the following 
propagation condition 


Qi m(n) a” = 0 U*a,, (2.7) 
where 
Qe m (2) = A,,°?) XP oy XB My Ng (2.8) 
4 Oe ane Pi 
(=. ane =| Hee ae Me 
Zale WE Pahl igh a | 


The tensor O(n) may be called the acoustical tensor® for the direction n in the 
elastic material with response function , subject to the deformation gradient x” ,. 
From (2.7) we read off the first main result, which may justly be called the 
Fresnel-Hadamard theorem in fully general form: The amplitude of a wave 
travelling in the direction of n must be a right proper vector of the acoustical tensor 
QO (n); the speed of propagation U must be such that @ U* is the corresponding proper 
number, conversely, any real right proper vector of O(n) ts a possible amplitude, 
provided the corresponding proper number is real and not negative. 


The speeds may be determined from the roots of the real cubic equation 
det (Orn oF 0 2 On) =0 (2.9) 


and hence may be exhibited explicitly as algebraic functions of the principal 
invariants of O(n); however, if it is known that @ is a possible amplitude, it is 
easier to determine the corresponding squared speed from the formula 


0 U? = Qt m) a’ a” a, (214 0) 


which follows at once from (2.7). 


4Tn any uniformly aeolotropic material d/.,,is in fact identically zero. However, 
at the interface of two dissimilar homogeneous materials, the above-stated condi- 
tions are not satisfied, for h itself fails to be continuous. 

5 In the hyperelastic theory, where, as we shall see in §12, Q(n) is symmetric 
for every n, the quadric of Q(n) is called Fresnel’s ellipsoid or the ellipsoid of polari- 
sation. Since Q(n) in the generality maintained here is not necessarily symmetric, 
it is not determined by its quadric, so no particular name is given to this latter. 
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We note that Q@(—n)=Q(n); that a is a possible amplitude if and only if 
ka is also a possible amplitude for every real k; and that U is a possible speed — 
of propagation for a wave travelling in the direction of n with amplitude a if | 
and only if —U is. Beyond the obvious symmetries corresponding to these : 
properties of Q(n), none other is to be expected in the general theory. 


3. General theory of waves of higher order 


For a singular surface of third order we have, among others, the following 
geometrical and kinematical conditions of compatibility [CFT, §191]: 


k ugh tht p qd 
[x by] =O 0 a Hp ¥y Min Mp Mg » 


[%",] =U% a" a, 


(3-1) 
Differentiation of (2.6) with respect to x? yields 


; 0A, (%B) aa 
Ae Mg By ep s ( m 


é 7, M8y Apap t pay Tie), Ket Oh,p =O h.0: (3.2) 


If we assume that f,,, a7, 4%;,, and @%,,,, are continuous across the singular 
surface, by taking the jump of (3.2) and employing (3.1) we conclude that 


Agm'? XY pa x4 4X gH Ng, N; = @ U? ay Mp» G.3) 


a formula which reduces at once to (2.7). Similar analysis may be applied to 
any singular surface of higher order. If we are willing to presume that the 
vectors d,, the field of body force f, and the functions f,* in the constitutive 
equation (2.3) are arbitrarily many times continuously differentiable, we may there- 
fore assert the following first theorem of equivalence?: a and U are a possible 
amplitude and speed of propagation for a wave of order p travelling in the direction 
n if and only if they are likewise possible for a wave order q, where p and q are 
any two integers greater than 1. Thus by considering waves of second order we 
exhaust the whole theory of singular surfaces other than shock waves and first 
or zero-order vortex sheets. 


4. General theory of plane infinitesimal progressive waves in a material subject 
to homogeneous strain 


A displacement of the form 
u=asin(n-P— U2), (4.1) 


where @ and m are constant vectors and P is the position vector, is said to 
represent a sinusoidal plane progressive wave of amplitude a, propagating in the 
direction m at speed U. In general, as is clear from (2.6), a displacement of 
this kind is not possible in an elastic material. There is, however, an important 
special case in which (2.6) may be replaced, approximately, by a linear system 
with constant coefficients, namely, when w is an infinitesimal displacement 
superimposed upon a given state of homogeneous strain. While many authors 


a While I cannot cite a previous statement of this theorem, it is only to be expected 
in view of known theorems for fluids and of EricKsENn’s corresponding theorem for 
isotropic incompressible hyperelastic materials [1953, 1, §6]. 
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during the past seventy years have derived various forms of the appropriate 
differential equations of motion, the simplest and most general treatment is 
) that of Toupin & BERNSTEIN [1961, 1, §2], which we shall here employ. Letting 
the co-ordinates X“ refer to the homogeneously strained initial state, in which 
the density is @), we may write the differential equations for the additional 
infinitesimal displacement w in the form 


Kade u? pp = 09 tim, (4.2) 


| 
‘where we have assumed that a, and that no extrinsic force is acting?, 


- and where 
Kuen = ai go ee xr aes Whigs ’ (4.3) 


&p being the shifter? effecting finite parallel displacement from X to #, and the 
argument of A,,“° being the gradient of the deformation from the natural 
state, with co-ordinates X%, to the initial state, with. co-ordinates X*. So as 
_ to derive (4.2) it is assumed that the deformation from X to X is homogeneous; 
_ there is no reason to expect, in general, that (4.2) will hold otherwise?. 


Substitution of (4.1) into (4.2) leads to the condition 


Kuo Np NR a 00 U2 an: (4.4) 


' Since the superimposed periodic displacement uw is assumed to correspond to 


_ an infinitesimal deformation, Xie ww ght ak x, and hence by (4.3) and (2.8) we see 
that - Kao?) np Na = gi eh Ame) Np dae ak in 
| = BME Ama” x? 2" Bp MP er MR, (4.5) 
| 

= §M8Q Omg: 


In virtue of this result, (4.4) has just the same form as does (2.7) shifted from 
a to XM. Since such a displacement is negligible in the present case, we obtain 
the second theorem of equivalence?*: at a point in an elastic material where the 
deformation gradient is x* ,, the amplitude a and the speed U satisfy the propagation 
condition for a singular surface of order 2 or more with umt normal n if and 
only if a plane infinitesimal sinusoidal wave of amplitude a may propagate in 
the direction of n at speed U through a specimen of the material subject to a homo- 
geneous strain in which the deformation gradient is x* , 


1 Since the differential equations are now linear, extrinsic force may be taken 
into account by the addition of a particular integral. If the extrinsic force is inde- 
pendent of time, so is the corresponding integral; therefore time-independent forces 
have no effect on the propagation of infinitesimal waves. 

2 For explanation, see CFT, App., §16. The reader content with a single Cartesian 


system may set ge =Ok. The devotees of convected co-ordinates must resort to more 
elaborate expedients. 
3 For arbitrary gradients X« , the differential equations for uw are of the form 


(Km v2 g), p=20tim (B) 


as shown by TouPin & BERNSTEIN [1961, 1, Eq. (2.19)]. 
4 This theorem generalizes a number of classical results in more special theories; 
cf. CFT, §194A. 
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5. Contrast and comparison of different types of waves 


In virtue of the two theorems of equivalence, we are justified in interpreting | 
any result in the theory of wave propagation either in terms of singular surfaces 
or in terms of plane infinitesimal oscillations, as we please. The mathematical 
status of these interpretations are not equal, however. The theory of infinitesimai 
oscillations results from ‘“‘approximations”’ of as yet unestablished validity; also, . 
it refers only to an underlying homogeneous strain and applies, at least as so 
car developed, only to plane waves. The speeds of propagation U are always | 
aonstant. The theory of singular surfaces is exact; it applies equally to all. 
omplitudes, all underlying states of strain, and all shapes of waves!. The speeds | 
af propagation depend upon the existing strain and thus may vary with place 
and time. In this sense, the interpretation of results of the theory in terms of 
fcceleration waves is both general and free of objection. 

On the other hand, the relevance of the theory of singular surfaces is 
uncertain. That theory refers, strictly, only to necessary conditions for 
solutions. If there is a solution in which a singular surface with unit normal 
n carries a jump of amplitude a, then (2.7) must be satisfied at each point, . 
but a solution to (2.7) for a and n varying smoothly over a set of points does 
not generally yield a singular surface. First, it is necessary that at each time 
the unit normals n correspond to surface elements that sweep out a surface. 
Second, as is clear from known examples in the classical linearized theory, the 
dynamical equations (2.6) impose conditions on the motion of the singular surface | 
and on the variation of a in time. The only deformation known to be possible 
for all uniformly aeolotropic materials is a state of homogeneous strain. Since 
x* .g=0 inany such state, it follows that [x*,,] =0, so that no non-trivial singular 
surface of second or higher order can travel through such a state. Thus, at 
present, no solution corresponding to any propagating singular surface is known. 

These facts raise the disquieting thought that the whole theory of singular 
surfaces may turn out to be an empty formalism. I doubt, however, that this 
danger is real. It is clear that solutions consisting of two different homogeneous 
strains connected by a plane shock wave exist. As the initial strength of this 
shock wave is taken smaller and smaller, we expect that the relation between 
its amplitude and its speed will tend to satisfy the propagation condition for 
waves of second order. Analysis toward this end has been given by JOUGUET 
(1920, 1—4] [1921], and doubtless a general theorem of equivalence corresponding 
to more modern views on the thermodynamics of deformation could be established. 
In this way should result a universal status for plane waves of second order, and 
possibly also for waves of certain curved forms, as limiting cases of weak shocks. 
A general theory of shock waves in elastic materials is reserved for later study. 

In this paper I present only a fairly complete analysis of the propagation 
condition (2.7), leaving for the future the search for corresponding solutions. 
So as to avoid lengthy wording, the necessary and sufficient criteria given in 
the following theorems refer only to the propagation condition, with no impli- 
cation that singular surfaces corresponding to the results actually exist. 


; Another difference may be seen in the effects of extrinsic forces: A continuous 
extrinsic force has no effect on the propagation of singular surfaces, while a time- 
independent extrinsic force has no effect on infinitesimal oscillations. 
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The interpretation in terms of infinitesimal oscillations has one advantage 
in that for it the non-positive proper numbers of Q(n) may be interpreted. 
For if U?<0, then U=+i«, where « > 0; from a linear combination of complex 
solutions of the type (4.1) we may construct a real solution of the form 


u=ae*'sin(n:P), (5:74) 
and the magnitude of such a solution increases beyond all bounds as {—> co. Thus, 


if the linearized theory may be trusted, there are certain disturbances which, 
however small they may be initially, in time grow large enough to destroy the 


_ presumed underlying homogeneous strain. Thus that homogeneous strain is 


unstable in the sense of infinitesimal oscillations?. Complex roots U? occur in 


_ conjugate pairs and lead to conjugate proper vectors, so that again a real 


solution of exponentially increasing magnitude may be formed. 
We should not let this fact induce us to require the constitutive equation 


| (2.3) to be such that Q(n) have only positive proper numbers for all directions n 


and all states of strain. Rather, it is possible that physical instabilities such as 


_ buckling observed for strains of a certain magnitude result from the inability 


of a material when strained to that degree to allow the passage of tiny waves, 


. especially transverse waves, without increasing their magnitude so much as to 


carry the body over into a state of small oscillation about a different configuration. 


6. Consequences of the principle of material indifference 


The theory studied so far is based on the constitutive equation (2.3) and 
hence is more general than elasticity, since it has not been required to satisfy 
the principle of material indifference [CFT, §239¢]. It includes, for example, 
MacCuLtacu’s theory of the rotationally elastic aether! [CFT, §302] and hence 
may be applied also in electromagnetic theory. As one of the many equivalent 
forms in which the effect of imposing the principle of material indifference may 
be put, we select the following specialization of (2.3) [CFT, Eq. (303.2) ]: 


The = 9g P*(0) (6.1) 


where 


eet (6.2) 


and where C is Green’s deformation tensor [CFT, §26]. Hence from (2.3) 


Hy == S57 eae (6.3) 


2 Haves & RIVLIN [1961, 2] call such a homogeneous strain “‘inherently unstable’. 
1 Nowhere in the foregoing sections has Caucuy’s second law of motion, t#”™=1"*, 
been imposed. In terms of the response function h, it is equivalent to the additional 


condition hee rm = hme ah a, (B) 


as follows from (2.3) and Eq. (A) in footnote 3, p. 266. In MacCuLLacu’s theory, 
on the other hand, ‘*”™=-—7”*. In the present ordering of ideas, CAucHy’s second 
law is imposed here along with the principle of material indifference, the two together 


being expressed by (6.1) and (6.2). 
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Since C,s=C;5,, there is no loss in generality if we suppose t** to be formally 
symmetrized in the components Cys. Thus we may put (2.5) into the form 
or @C,s 
i koe Ox” 5 


Age aint" + Seg xf 


(6.4) 
(Ghee 
= ye + 2825 8mq ee x4 6 OCg5 
Let us set : 
POY ae: p a 
CONN ee ee aC, 5 (6.5) 
Then 
Cred CmhPr = CVmir- (6.6) 
these relations of symmetry reflect the principle of material indifference. 
If we now set 
Bam d eH Bg BB ALS, (6.7) 
so that | 
Alam? =~ Ber Sms Xp Kg BPM, (6.8) 


then from (6.4) and (6.5) we see that? 
Brmea ome ya aide le Chmeg. (6.9) 

Hence by (6.6) it follows that 

Brmea en Bre = gh png _ gm eke 


Bima __ BEM ooh E Pt = ph TEN (6.10) 
Bimer_ _mkap grrgne jad ae fee. 


These identities, again, reflect the principle of material indifference. 


In the classical infinitesimal theory, the tensors A,,(*°, B’"?%, and C*™?2 
all reduce to the tensor of linear elasticities. 


Substituting (6.8) into (2.8) yields an expression for the acoustical tensor 
in terms of B*™??; 


km __ @ k 
QU Se a ik RN” eee 


© (58 84 + 8 6%) BEY" n,n 


(6.11) 


q? 


Q 
— kp 
ron B Pitty Ng 
Another consequence of the principle of material indifference is the fact that 
the speeds of propagation are determined by the strain alone, independently of the 
local rotation. This is most easily seen by demonstrating an alternative form of 


the propagation condition (2.7), expressed in terms of material rather than 
spatial co-ordinates. Set 


ee OP Satin Vy = x" Ny. (6.12) | 


* This result was given by Toupin & BERNSTEIN [1961, 1, Eq. (2.23)] for the} 
case of hyperelastic materials. 
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0 from (6.11) and (6.9) we see that 
] 
S% == Xg BAY Brin, My X” 6, 
er Se Fie Phe tare Te Ny 0 

Q 

: nyt? (6.13) 
a a 

Au 

| = es. 
= ee Aes 

DiSie8 ies OF BP ate Cy ple Chae 6.1 
) 3” °(C) $a Stlecy a anen (6.14) 


‘Thus, for a given material, D%,”° is a given function of € and hence is independent 
of the local rotation. From ©. 13) and (6.12) we see that the propagation con- 
dition (2.7) may be written in the form 


(S% — 6 U2 6%) A®=0, (6.15) 


| 
where we may‘ choose A in such a way that 

, AP — RB, ak (6.16) 
R being the local rotation. The speeds of propagation are determined from (6.15) 
by the cubic equation det (S%, — 6 U2.4%) =0 (6.17) 
and hence are algebraic functions of the principal invariants of the tensor S(C, v). 
The right proper vectors of S may be calculated from those of @ by means of 
(6.16); thus the acoustic axes may be obtained by effecting upon the right proper 


vectors of S the same rotation as that carrying the principal axes of strain at 
-X into those at 2. 


7. Isotropic materials 


For an isotropic material the function t(C) is an isotropic function. By using 
the representation theorem for such functions it is easy to show that (6.1) may 
be reduced?! to the form 


theif +h Bi, +te BEB Bes (7.1) 


where? ed Re 
Bem a PO ae eg. (7.2) 


and where the response coefficients /,, f;, fg are functions of the principal invari- 
ants J, I, III of B. Caucuy’s first law of motion (2.4) may be written in the 


3 This analysis generalizes and simplifies that of Toupin & BERNSTEIN [1961, 1, §4]. 


It employs the identity 
ler wf, xP 51”? (C), (C) 


which follows from (6.1) and Eq. (A) in footnote 3, p. 266. The result is equivalent 
to one stated and proved in a different way, again for the hyperelastic case, by 
Torotri [1943, §1, p.1]: The form of the quadric of polarization is independent 
of the local rotation. 

4 This possibility follows by decomposing the deformation into a rotation and a 
pure stretch; cf. CFT, §37. 

1 F.g., §303 of CFT. 

2 B is the tensor denoted by e in §29 of CFT. 
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a 


more familiar form [CFT, Eq. (205.2)] 


i” tof=or, (7.3); 
where 0 is the density in the deformed material. By putting (7.1) into (7.3); 
we see that? IREP, Be ae of a 0 k*, (7.4) 
4 
eG RM Clank Lo TN eee pen 
ih Pa Ree 7 qb? 


= 3 f, (55 09’ + 9 65) + 
+2, (0; BY + 6 BY + bf BY + 67B5) + 


km fo Ofo E B ) Las ofa 2. Z (7-3) 
8 pe Boat ary 4 pa — Poa) alll. 24 
mie of eh op 
+ Bt bog + arr Spa — Boa) + IU srry Bia] + 
rm| Ofs Ofs ofp 
+ BEB ™ [2h g,,+ Sle (1 b5g— Byq) + Il ir Boal, 
and, by (7.2), Bet va Xe ee? (ae eae #4 ab 5). (7.6) } 


If we calculate the jump of (7.4) at a singular surface of second order, by} 
using (7.6) and (2.1) we again obtain the propagation condition (2.7), where Q} 
now has the following explicit form: 


On = 22 TH yg BU hy M. (7.7) } 


By (7.5), T*?nq is a function of B only; hence, for given m, the acoustical! 
tensor Q is a function of B only. Therefore, as is only natural, 7m an isotropic: 
medium not only the speeds of propagation but also the acoustic axes for waves: 
travelling in a given direction are determined by the strain alone. 


A wave travelling down a principal axis of stress and strain in an isotropic: 
medium may be called a principal wave. The results to be expected in this: 
special case have been described in §1. To render explicit the statements made : 

=! 
there, let m, be a proper vector of B, so that B’ n§=vi ni, Bo n{=nj/vz, where 2, , 
is the stretch in the direction of the principal axis corresponding to n,. Substituting: 
(7.5) into (7.7) yields at once the following explicit form for the acoustical tensor®: | 


a vi" Q*,, (1) ae (fi + vi fe) On =e le BE+ 


2 
+ 2 mh f+ ob fet oe 
r=0 
* The approach here is that introduced by ErtcKsEN [1953, 1] for more special 
circumstances. 

* To effect the calculation, use Eqs. App. (38.16) of CFT. 

* In effect, this tensor may be calculated (in the hyperelastic case) by combining 
Eqs. (4.8), (4.7), and (3.10) of the paper by Haves & Rivirn [1961, 2], but, perhaps 
hindered by an exuberance of formalism, they did not state the simple and conclusive 
results derived here. Earlier Tororri [1943, Eqs. (18’) (19’)] had virtually calculated 
Q(n) for isotropic hyperelastic materials but, again apparently because of obscurity 


of notation and method, did not conclude anything except the independence of the 
acoustic axes from the local rotation. 


(7.8) 


ofr 2 a, Ofr 2,9 Ofr 
ap + at 8) arr + %2%8 Sry 
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} Since the remaining principal axes of B are normal to n,, we see by inspection 
of (7.8) that they are also principal axes of Q@(n,); that is, the acoustic axes for 
principal waves coincide with the principal axes. In particular, every principal 
| wave is either longitudinal or transverse. 


| By using (2.10) we may now calculate the speeds of propagation exactly and 
, explicitly®. Setting a=n,, we obtain the squared speed U;;, of longitudinal waves 
along the principal axis in which the stretch is v,: 


9 UY, 
DOr 


Ofr , ,2,2 Ofr 
arr | 2°83 errr |? 


=fr+20tfe + Soa sp + (ea +3) 


(7.9) 


while for transverse waves with amplitude parallel to n, the squared speed U4 
/ is given by 


| iat (vi + 03) fe. (7.10) 


| The formulae (7.9) and (7.10) give the explicit and exact speeds of propagation 
| of principal waves. ERICKSEN’ has pointed out that they assume a simpler 
form if expressed in terms of the relations giving the principal stresses ¢, as 
_ functions of the principal stretches: 


= fot fut fer, (7.11) 
' as follows from (7.1). Since 


| Often, Ohm 
| aut al 


! (v2 v2) Chitanprah ene int (7.12) 


oe 4) 


from (7.11) we see that the right-hand side of (7.9) is equal to d¢,/v}. Hence 


ot 


Beigel < Tete, 
0 Uy dlog v, ) (7.13) 
while from (7.10) it follows at once that 
U; hotly 
tem (7.14) 


Since these elegant formulae do not make direct use of the response coefficients fp, 
they give insight into the general nature of wave propagation in all kinds of 
elastic materials. 


6 These results generalize a long sequence of special formulae for various kinds 
of materials in various circumstances. Some of the relevant literature, which is 
mainly physical, is cited by Toupin & BERNSTEIN [19614, 1]. 

The formula (7.10) is similar to one given by ERICKSEN [1953, 1, Eq. (4.6)] for 
incompressible hyperelastic materials. The results above, however, do not seem to 
be obvious from his, because the condition of incompressibility by itself renders 
all waves transversal, and thus for incompressible materials the whole picture of 
wave propagation is not really representative of the general case. 

?7 Letter dated July 13, 1961. 

8 Here ¢, is regarded as a function of the three independent variables log v,, log vy 
and log v3. For special families of deformations a result of different form will follow. 
For example, if only deformations such that v;=v,=v3 are considered, we obtain 
a result of the form (8.4) below. 
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By using (7.10) we see that 
£” (U2 — Urs) = (v3 — 28) fe. (7.45) 


vt | 
From this formula follows the fundamental theorem on transverse principal 
waves: In an isotropic elastic material such that f2=O0 for the strain im question, 
both kinds of transverse waves that may travel down a given principal axis of strain 
have the same absolute speed of propagation, if, however, fg==0, then these two 
kinds of transverse waves travel at the same absolute speed if and only if the cor- 
responding principal stretches are equal; and in this latter case, any transverse 
amplitude is possible, and all these transverse waves have the same absolute speeds 
of propagation. From (7.15), moreover, we may interpret the sign of the function /g. 
Excluding the case when v,=v3, we see that if fp<0, transverse waves with 
amplitudes parallel to the axis of lesser transverse stretch travel at greater absolute 
speed than the others; if fg > 0, at lesser. 

From (7.9) it is clear that Uj, = Ujz if v; =v,. A converse condition is immediate 
from (7.13), leading to the fundamental theorem on longitudinal principal 
waves: In an isotropic elastic material, the squared speeds of longitudinal waves — 
in the directions of two equal principal stretches are equal; conversely, if Uy,= Ugo, 
it 1s necessary that! Ot, Ot, 


(7.16) 


élogyv, d@logv, ~ 
These results show that the state of affairs in the linearized theory, where all 


transverse waves travel at one absolute speed and all longitudinal waves at . 
another, is not at all typical for elastic materials. 


8. Isotropic materials subject to hydrostatic pressure 


Only the case of hydrostatic stress partakes of the simplicity to which the 
linearized theory has accustomed us. Indeed, if v, =v,=v3=v, then every direc- 
tion is a principal axis of stress and strain, so that all waves are principal waves, 
while (7.9) and (7.10) show that Uj = U,,=U33= Uj, say, and U,=Un=--- = 
Usa= Uj, say. That is, in a state of hydrostatic pressure, longitudinal and trans- 
verse waves may propagate in any direction; at each point, all longitudinal waves 
have the same absolute speed, and all transverse waves have the same absolute speed. 


To calculate these speeds, we note that since J =3v?, IT=3v4, [JT =v%, it follows 
that ) 


0 0 ile 
ar par Parr a ae. B.A 
for the class of deformations being considered. Moreover, by (7.1), t= —p6%, 
where =(g). From (7.9) and (7.10) it is now easy to show that 
Ut ee 2v2 ap 
lI 3 0 (f+ v fe) te de ) (8.2) 
5 
Ut =~ (ti + 20? fe). (8.3) 


* For rubbers f,<0 according to the results of experiments. The sign of the more 
frequently used coefficient h_,;, which is introduced in §11, is the same as that of fe. 
*° By calculating Uj,—U}, from (7.9) it is a trivial matter to express this con- 


dition in terms of the response coefficients fr, but the resulting formula is too 
elaborate to be useful. 


* Recall the caution expressed in § 5, 
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By eliminating /;+2v?/, between these two results, we derive the following 
_not inelegant wniversal relation between the two wave speeds and the bulk 
modulus in any state of hydrostatic pressure: 


Uj= Fu + ee. (8.4) 
| Two special cases are already familiar. First, for an ideal fluid we have f;=f,.=0, 
_ while 7p =g (II) = — p(0), say. By (8.3), U, =0 in this case2, while (8.4) reduces 
to the classical formula for the speed of sound in fluids. Second, in the linearized 
_ theory of elasticity, (8.4) reduces to a trivial identity connecting the two squared 
_ wave-speeds, viz (A+ 2u)/0, w/e, with the bulk modulus, (A+ 2y)/o. 

It is reasonable to assume that df/do>0 in all circumstances. Thus from 


b (8.4) we see that 
* Up > sul (8.5) 
for any state of hydrostatic pressure in any elastic material. In particular, 
longitudinal waves always propagate faster than transverse waves, provided the 
_ latter can exist at all. Whether or not U, is real is a more delicate matter. 
| Some conditions of reality are discussed in §10. For the time being, however, 
we content ourselves with interpreting (8.4) as follows: I a state of hydrostatic 
stress, if transverse waves may propagate at all, so may longitudinal waves, and then 
_ dp 
Ue; 8.6 
I do (8.6) 
that 1s, if transverse waves may. propagate, the squared speed of longitudinal waves 
is always greater than the squared wave-speed according to the hydrodynamics of 
an ideal fluid with the same law of compression. If, however, Oe 0, then 
2. ap 
Therefore, if neither material transverse singularities nor transverse waves are 
possible, the squared speed of longitudinal waves is either less than that of 
sound waves in a fluid with the same law of compression, or is zero, or is 
imaginary. The case when Uj <0 cannot be excluded a priort. 
The foregoing results on the speeds of waves in elastic materials subject to 
great hydrostatic pressure may be not devoid of interest to seismologists. 


9. Determination of the stress-strain relations from wave speeds 


When the form of the stress-strain relations is known, formulae such as (7.9) 
and (7.10) give unique values for the squared speeds of the 9 possible kinds of 
principal waves. Conversely, if these 9 squared speeds are known as functions of 
the three principal stretches v,, as for example from the results of experiments, 
then the three response coefficients f,, f;, and f, are uniquely determined. 

Indeed, they are overdetermined. From (7.9) and (7.10) we may derive a 
sequence of universal relations connecting the wave speeds, relations which must 
hold in any isotropic elastic material regardless of the forms of fy, 7;, and fg. 


2 Thus transverse singular surfaces of second or higher order cannot propagate 
in an ideal fluid. Other remarks on this case are given in footnote 4, p. 281, below. 
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Only if these relations are satisfied can there exist functions tosttas and le such 2 
that (7.9) and (7.10) hold. Thus these universal relations serve as conditions of 
compatibility for measured values of the wave speeds: Unless these relations be | 
satisfied by the data, it is not consistent with the theory of isotropic elasticity?. 

Compatibility of transverse waves. From (7.10) we derive at once 2 sets of 3 
universal relations: 


2 2 2 2 
Uy Us, UZ Ces ae U3 2 Us i ae oe U; 3 (9 1) 
— = —— = —_— == Aen 4 
vi ie v3 ie v2 v2 
2 2 2 2 
1 (oe oe) 1 le es L 1 (<3 “18 
ae crue eee ie et Pere: ea Pe eee ee We 
vi —vz \ vj v3 vz—vj \ v3 U3 U3— Ug \ U3 vy (9 2) 


= P (Vy, Ve, U3) , 


where @ is a symmetric function of its arguments. Of these 6 conditions, the 
first 3 are independent, but from any 1 of the last 3 the other 2 may be derived 
by use of the first set, so in all there are 4 independent conditions. If these 
are satisfied, the functions /, and fz are uniquely determined by relations of the 
type (7.10). For example, 


<= os Uae Vas 
fze=0~7=e v3 (v3—v2) ” 
1 U, (fe (Of 
fiat] aes es oral: (3) 


In each case the second form follows from the first by use of the identities (9.1) 
and (9.2). From the form of the results we see that 7m an isotropic elastic material, 
the response coefficients f, and fy are uniquely determined by the speeds of the two 
kinds of transverse principal waves corresponding to any one principal axis. The 
experimenter willing to take compatibility for granted may content himself with 
measuring U; and U,4 as functions of v,, vs, v3. 

Compatibility of longitudinal waves. Let us assume that the conditions of 
compatibility for the transverse principal waves are satisfied, so that f, and fs 
are uniquely determined. We now consider the compatibility of the speeds of 
the 3 longitudinal waves, from which the single function fy) is to be determined. 

If we set 


2 
2 er| a 2, 0 ro 
fa 20h fa— Droit | SP + (oh + v9) 2 4 obs SIE], (0.4) 


‘Toupin & BERNSTEIN [1961, 1, §4] were the first to consider problems of this 
kind. They derived necessary and sufficient conditions for the linearized theory of 
hyperelastic materials. Earlier HucHres & KELLY [1953, 2, p. 1146], after obtaining 
more than sufficient formulae for determining the second-order elasticities of isotropic 
hyperelastic materials, had remarked that “any additional measurements give checks 
on the theory”’. 

The results I present above for the finite theory are sufficient only as regards the 
nine kinds of principal waves. If waves travelling in other directions are to be con- 
sidered, additional conditions of compatibility must be satisfied. 


| 
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, then because of (9. 3) we can regard the 3 quantities 4, as explicitly given 


. fee OPUS URN Ue ie Uae ay, v,, and we are to Fadl fy from 3 relations 
' of the type 


a) 
ey + (v3 + 03) ae + v3 U3 on (9.5) 


If the three principal stretches v, are distinct, we may solve (9.5) as a linear 
system, obtaining the following results: 


Oly A, vi (v3 — 03) + A, v3 (v3 —v?) + A, v§ (v3 — v3) eer 


G! eal (vj —03) (vg—v) (vg—v}) Prvettig 
Of ES A, vi (v3— v3) +A, v3 (vg—vj) +A, v3 (v i—V3) ees 
arl 02) (03-08) (08-02) Fe 


arr (0808) (0803) (08-03) 


(v3 
fo _ Ay(vb—08) +42 (03-08) +4, (4-08) _ pp 


The quantities B, are known functions of v,, v2, v3; they must be symmetric 
functions and hence equal to functions of J, JJ, III. In order that f/, may be 
determined from the system (9.6), the functions B, must satisfy the following 
conditions of integrability: 


eB, OB, 0B, eB OB, OB, (0.7) 
Onn vere * Oli Taetiodds? Olea Aols ' 
If these conditions are satisfied, then 
fo= [(B dI + B,dII + B,dII1). (9.8) 


A summary of these results is given by the compatibility theorem: Let the 
9 squared speeds of propagation of principal waves be known functions of the principal 
stretches. This data 1s consistent with the theory of isotropic elastic materials if and 
only tf: 

(a) The quantities yp, B,, By, By, as defined by (9.2), and (9.6), are symmetric 
functions of the principal stretches. 

(b) The 4 algebraic conditions (9.1) and (9.2), and the 3 differential conditions 
(9.7) are satisfied. 

If conditions (a) and (b) hold, the data determines a unique form for the stress- 
strain relations by means of (9.3) and (9.8). 


10. Conditions that the waves be real 

As yet, we have not taken up the question of whether the speeds and cor- 
responding amplitudes are real. Only real waves have physical significance. The 
results so far stated are carefully phrased so as to refer, for the most part, only 
to necessary conditions. In the generality maintained thus far, 2 7s possible 
that no waves can exist. For Q(n), being an arbitrary 3 x3 matrix, may have 
but one real right proper vector, and the corresponding proper number may be 
negative, leading to a purely imaginary value of U. 

Various inequalities have been proposed so as to insure that the response 
function },* in (2.3) describe a material not altogether unreasonable for physical 


Arch, Rational Mech. Anal., Vol. 8 20 


280 C. TRUESDELL: 


applications!. All of these turn out to have some bearing on the propagation 
of waves, and in this context we are now in a position to read off results more 
general than any heretofore known. ‘wil 

First, for the hyperelastic case HADAMARD [1903, p. 269] gave a definition 
of stability from which he proved that in a stable state all speeds of propagation 
are real. I shall not discuss this condition here? because it is not clear how to 
extend HapAMARD’s definition of stability to general elastic materials, when 
there need be no strain energy. 

In the hyperelastic case, however, HADAMARD’S criterion implies that the 
equations of equilibrium are strongly elliptic, and a condition to this end we 
can easily formulate and discuss in full generality from (2.6); namely, the S-£ 


condition is Aaa em A Ap 0 (10.1) 
for arbitrary vectors land A. From (2.8) we see that an equivalent condition is 
Qc m) (n) a ” > 0 (10.2) 


for arbitrary n and I. That is, the S-E condition is necessary and sufficient 
that the symmetric part of the acoustical tensor be positive-definite, for every direction 
of propagation. From (2.10) it thus follows that U?>0 if @ is real. That is, 
if the S-E condition holds, the squared speed of propagation corresponding to every 
veal acoustic axis 1s positive. In particular, since for given m there is at least one 
real amplitude a, the S-E condition implies that for any direction of propagation 
there exists at least one veal amplitude with positive speed. 

Since, however, the S-E condition restricts only the symmetric part of 
Q(n), it cannot guarantee that in general there are three real acoustic axes. 
In the important special case of principal waves in an isotropic material, we 
have exhibited three real axes. By the results in §6, then, the S-E condition implies 
that the squared speeds of all principal waves in an isotropic material are positive. 

The foregoing results, except for the last one, which refers to isotropic 
materials, are independent of the principle of material indifference. When this 
principle is satisfied, from (6.8) we see that (10.1) may be expressed in the follow- 


ing equivalent f : m 
g qulv en. orm Bre 1], a Lj e >0 (10.3) 
for arbitrary vectors ! and L. A formally similar but in fact different condition is 
BIA See gee O (10.4) 


* A review of this question, which I proposed in a lecture [1956] before the Gesell- 
schaft fiir angewandte Mathematik und Mechanik in Berlin in 1955, is given by 
TOUPIN [1961, 3]. 

2 It must be added, however, that the criticism by SIGNORINI and ToLotrTt [1943] 
entirely misses the point of HapAMARD’s work by omitting any mention of the 
question of stability. As shown in the text above, it is entirely obvious that for 
fully general stress-strain relations there need be no possibility of real waves. More- 
over, the very special stress-strain relations derived by Tororti [1943, Eq. (3)] do 
not exhaust the possible isotropic hyperelastic materials in which all speeds of propa- 
gation are real, for ToLorrr imposes at the outset the unduly restrictive additional 
condition that one acoustical axis be longitudinal for any direction of propagation, 
while transverse waves may have any perpendicular amplitude. As to be expected, 
he succeeds in showing that scarcely any materials are of this kind. 

Hapamarn’s analysis of stability was questioned by DuneEm [1905, Ch. III, § IV]. 
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| for all symmetric non-vanishing tensors §. For hyperelastic materials, a condition 
| of this type is equivalent to one proposed by CoLtemAN & Nor? on the basis 


of a compelling thermodynamic argument, supported by important corollaries 


( in statics. We shall call (10.4) the generalized C-N condition. Unlike the S-E 
) condition, it cannot be expressed entirely in terms of the acoustical tensor. 


It is difficult to draw general conclusions from (10.4). From examples? it 
is known that neither the generalized C-N condition nor the S-E condition 


i imples the other for all materials and all strains. In the special case when we 
put S,,=,,, by using (6.11) we derive at once the necessary condition 


Q*™ (n) 2, Ny, > 0- (10.5) 


In general, n is not a right proper vector of Q(n); if it is, from (10.5) and (2.10) 
| we see that the corresponding speed of propagation is real. That is, the generalized 


C-N condition insures that if a longitudinal amplitude is possible, the corresponding 


_ squared speed is positive. 


In the isotropic case, ERICKSEN’s formulae (7.13) and (7.14) make possible 
the statement of simple and immediate criteria, which may be summarized in the 
fundamental theorem on the speeds of principal waves: In an isotropic 
material the squared speeds of the principal longitudinal waves are positive if and only 
af each principal tension is an increasing function of the corresponding principal 
stretch when the other principal stretches are held constant; the squared speeds of 


the principal transverse waves are positive tf and only if the greater principal tension 


occurs always tn the direction of the greater principal stretch. 

As has already been shown, the S-E condition implies that all squared wave 
speeds are positive; hence it implies the two statical conditions just stated. The 
first of these is implied also by the generalized C-N condition; to verify this fact, 


3 For the connection with the work of CoLEMAN & NoLt [1959, 1], the reader is 
referred to the paper of Toupin & BERNSTEIN [1961, 1, §3]. As explained there, 
the requirements of Coreman & Nort allow equality to replace inequality in (10.4) 
for certain exceptional strains, but in the interest of simpler statements of theorems 
we do not consider these. 

4 TouPIN & BERNSTEIN [1960, 1, §3] construct a simple example of an isotropic 
hyperelastic solid which when subject to hydrostatic pressures within a certain range 
satisfies the C-N condition but does not satisfy the S-E condition. 

As they remark, an ideal fluid for which p>0 and dp/de@>0 serves as an example 
of a material in which the C-N condition is satisfied for all strains but the S-E 
condition for none. To verify this fact from the present point of view it suffices 
to note that fp=g (III) =—p(e), f:=fe=0, so that (7.7) yields 


Qe (n) =2 ; III ue n* ny,=6 a n* Ny. (D) 
As in all isotropic materials subject to hydrostatic pressure, any transverse amplitude 
is possible, but herve the corresponding speed of propagation is zevo. That is, in the 
notation of §8, U,=0. From (2.1) and (3.1) it follows that im an ideal fluid, the 
only possible transverse singular surfaces of order greater than F ave material SUV faces 
across which only certain purely spatial derivatives such as ae “Cie x apy etc. suffer 
discontinuities [cf. CFT, §190]. The C-N condition, as was shown in the researches 
of Coteman & NOLL, is sufficient that dp/dg>0 and hence that Uj >0, by (8.4). 
Since U; =0, the S-E condition can never be satisfied. Conversely, the condition 
that the squared speeds of longitudinal waves be positive, since it is equivalent to 
dp/de>0, is weaker than the C-N condition, which implies that p> 0 also. 


20* 
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we need only notice that, as was shown in §7 »a longitudinal amplitude is always 
possible for a principal wave in an isotropic material, so that the italicized | 
statement following (10.5) becomes applicable. Thus the generalized C-N con 
dition implies that the squared speeds of all principal longitudinal waves im an ° 
isotropic material are positwe’. bts 

The generalized C-N condition is not so intimately connected with trans- 
verse waves. The second statical condition in the last theorem, namely, that 
the greater principal tension occurs in the direction of the greater principal 
stretch, was proposed and explored by BAKER & ErIcKSEN [1954] and may be 
called the B-E condition®. Merely rephrasing a result already proved, we may 
assert that the B-E condition is necessary and sufficient that the squared speeds 
of all principal transverse waves be positive’. It is easy to show directly that 
for isotropic hyperelastic® materials the C-N condition implies the B-E condition 
if all principal tensions are positive. Hence the C-N condition implies that in a 
state of pure tension in an isotropic hyperelastic material, the squared speeds of 
all principal waves are positive. 


11. Waves according to the second-order theory of elasticity 
for isotropic materials * 
We now apply the foregoing results to the determination of second-order 
elastic moduli from the speeds of waves. To this end we write (7.1) in the form? 


25 5 
(ems bn hpi se hele, (44.1) 


where the coefficients hp(I, IJ, III) are related as follows to the coefficients fp 
used previously : 
ATs I 


fo=ho+ radi h_j, fa tate hee: le 


x 


=a h-1- (11.2) 


The classical tensor H for measuring finite strain is given by 
EG = 3 (Cz — 09). (11.3) 


* A treatment of this subject from a slightly different viewpoint will appear in 
the proceedings of the Symposium on Second-Order Effects in Elasticity, Plasticity 
and Fluid Dynamics, to be held at Haifa next Easter. 

° This theorem is due to Toupin [1961, 3, Th. XII]. While his argument goes 
along correct lines, the undetermined form for Q(n) he in effect assumes as starting 
point is not sufficiently general, as may be seen from the explicit formula (7.8), above. 

°In fact Baker & EricksEen allow inequality to be replaced by equality if 
Yg=v,, but for ease of stating the result we leave this possibility out of account. 

* Hence follows as a trivial corollary a result apparently first published, and 
only for hyperelastic materials, by Haves & Riviin [1961, 2]: The S-E condition 
implies the B-E condition. This fact is easily proved directly from the definitions, 
without reference to the theory of waves. 

The theorem in the text above is close to one proved by ERICKSEN [1953, 1, §4] 
for incompressible hyperelastic materials. 

* I believe this result holds for all isotropic elastic materials but have not yet 
succeeded in constructing a proof. 

‘For the purposes of the present paper it would be easier to remain with the | 


coefficients fp, but the coefficients hp are those employed in most of the modern 
researches on other problems in elasticity. 
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Hence the principal invariants of E are related to those of B by the following 
identities [CFT, Eqs. (31.5)]: 


Ip =3 (I — 3), T=3+ 21g, 
Ig =4 (II — 21 +3), Tejera ale, (11.4) 
Ip = $ (LIT — II +I —1), TIT =14+ 217+ 4], + 811g. 


Let the response coefficients hp be expanded in power series in the invariants 
“? hplu=erpo+epryIpt+epel p+ ap3lpt+::-. (11.5) 
The extensions 6, are defined in terms of the stretches v, as follows: 

V,=1+ 6,. (11.6) 


Our objective is to express the wave speeds approximately as linear functions 
of the 6,, with coeffcients determined by the «,, occurring in the assumed 
expansions (11.5). It turns out that in the results the wp, occur only in the 
following combinations?: 


Lae ee 
— == Oy = Og +07, +17, 
Mt 

1 == %19 — “19, 


Xz = Ape + Ae +72, 


(11.7) 
067 = C9 ACH aC 


p= Kp — M71) 
hg = 4a_79, 


where A and yu are the classical linear moduli. 

Let us introduce the notation = to indicate equality to within an error 
which is O(d{-+ 63-+.63) as 6j-+63+.63—>0. Let & be the linear approximation to 
the cubical dilation: 

O = 0,+ 0, + 05. (11.8) 
Then 
1 KO (11.9) 


From (7.10), by use of (11.5) and (11.7), we see that 
1 


a - 205) h =p / 5) 


OE es ue, 
© (4 264) hy 


t 
= (1+ 20) (470 + ee (1 — 20) (@_10 +0119), (14.40) 
= (1+ 26,) (%19 — 410) + (411 — & 11) B+ 24719 (5, + 45), 
=1+26,+ 4459+ $a (d, + 6,). 
‘eZ oo = o(1+9), (14.11) 
2 To the same order of approximation, the stress-strain relations are of the form 
tt w=, Tp Oh, + 2B + (tty pt oe Lg) Om, + Os Lig Bh, + 06 BG ER, (E) 


where E is the classical linear tensor of infinitesimal strain, but this fact is not used 
in the analysis. 
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we may express (11.10) in the alternative form 


bb 
giving the second-order speeds of transverse principal waves. 
It is not so easy to get the corresponding results for longitudinal waves. 
First we write (7.9) in the form 


=A+B, (11.13) 


a 
dew: + 2u5 fe, 
B= ye he a (v5 4 v3) ofr | veve ofr 


SIL (in waggle li 
Then from (11.2), (144.5) and (11.7) it follows that 


where 
(11.14) 


A 1 vi—vz—v3 
h | 2 3 
| Ay v2 v3 v3 hz 
—1— 20-40, 
1+2¢0 


= yg + 0%7,9+ 
=14+ 4459+ a 6. 


Also 
meh } 
B= “lo +3149 — 4) 2h + (1420-26) Ae 4 
+ (1428) [28 +2(14+0—6,) “2 + (1428 —26,) it|+ 
| efe Of, Of 
rate as tH 2(1+9—d) 2 +(1+28— 26,) Bade aa 
Ya] . 
é 
nace oY faa ® arr) Phir hea 
ese (or +233 all | arr) tr + 2f2). 
From (11. ie we see that 
2 VC Ee ese é é Lay <p 
or +2 aa © OLED Sooner al + Ot 4 Og: “147 
Hence hie becomes 


0 
2 big Wot h+ te) ++ (0 -4,)- Brig tot hit fe) Os sr a+ te). (14.48) 


Now by (11.2) and (11.4), 


B= 


lord Hoe = Ng hy oh oa SII Ir sae 
Hence WErSineeiiren vie - © (14.19) 


Ame + fa) os a ( 


= FW (0g, +01, + a_1;) + (Hog +%72+0_19)9, (11.20) 
=U (ee, +039). 


hg +h, +h_)), 


goa 4428, +(1-+ 5 a5) 8 +5 a6 (51 + 45). (11.42) | 


t 


(%_19 + %_179), (11.15) 
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Likewise 
O 5 
(9 — dy) OTe (fo + f1+ fe) = (8 — 4)- 5 te (%03 + 73+ %_73), 
(11.21) 
= (0 — 6) Fe X45 
and 
8, aig Uta + 2h) + 61 go (ts — ha), 
(11.22) 


= (071 — & 17) 0, = Fu 5 Oy. 
Putting (11.20), (11.24), and (11.22) into (11.18) yields 


B. 


hn 
2 


ay + A+ (as + Foy + tas] 0 4 ( Tog + ts 4 a) Ot: (11.23) 


By substituting (11.15) and (11.23) into (41.13) and then using (11.11) we obtain 
the definitive result for the second-order speeds of longitudinal principal waves: 
Opin. A 2h 


; pon (7 L 24 Qeet ay - a5) 0 ! e EAE Sy a gael 2016) dy. (11.24) 


The formulae (11.12) and (11.24), giving the speeds of propagation explicitly 
and generally for the second-order theory, include and generalize a long series 
of particular results of this kind. 


L. BRILLovIN [1925, Eqs. (76), (77)] [1938, Eqs. (XI.125), (X1I.126)] was the first 
to calculate the second-order wave speeds for a general hyperelastic material subject to 
uniform hydrostatic pressure. To reduce our results to forms corresponding to his, put 
Die 0, 0,=¢, Lhen Uj,—U3,;=—U;,=Uji, and Uj,=U3,= -°+=U3,=UF, and from 
(11.12) and (11.24) we see that? 


2 
* a -- SOS OS ACH eG Nese PC 
2 
pre eon, oe Se (11.25) 
1 ae a T 2 5 6° 


While L. Brittoutn [1925, §11] had calculated the second-order speeds of prin- 
cipal waves in a medium subject to uniaxial tension, apparently HuGHEs & KELLY 
[1953, 2, Eqs. (12)] were the first to obtain a sufficient number of independent relations 
to determine the second-order elasticities from wave speeds in hyperelastic materials. 
Their method of linearization is not altogether clear to me, and I have not attempted 
to check their results against mine. Toupin & BERNSTEIN [1961, 1, Eqs. (5.12) ] obtained 
formulae which are easier to assess. They considered a biaxial tension. To obtain 
results corresponding to theirs, put U},= vj, Ufz=Um, Ujs= v4, 0, = 0;= — 60, = — Ce, 


3 To get BRILLOUIN’s results, which presuppose a hyperelastic material, put 
@o,=—A+8A+424B, 
fp %=—16A, 
HM o&;=2(A—w+84), 


X= 4 (u+6C). 
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where o is to be held constant as e>0. Then from (41.12) and (11.24) we see that4 | 


2 2A 
Go A aes 20) (4 +2+20,+a,-+4 as} ol + 4— y+ %§ 4 2a), 
fe 


uw de lb 

Qo dm : be bod (11.26 

oh a =1 2 (1—20) «54 2 ( G) OX, ) 
2 

ORS (oe (1—20) a,—oa,. 


pw de 2 
Toupin & BERNSTEIN have developed alternative forms of these results, suitable 
for immediate comparison with experimental data. 

Let us now suppose that the 9 squared wave speeds UZ, are known, possibly from 
experimental data, as linear functions of the extensions 6,. We shall find necessary 
and sufficient conditions that these linear expressions be compatible with second- 
order elasticity for some value of the coefficients ap. First, the general con- 
ditions (9.1) and (9.2) for the transverse waves may be replaced by the following 
second-order approximations : 

ie Uy, Uys, — Us’s ~~ UF, i Us es U2 
0, — 02 - 02— 03 a 0; — 03 Fie 


TOS, a Ut a (oi ey ee ee 
a 2 “by bs 


where U/ is the squared speed of transverse waves according to the linearized 
theory. Alternatively, these conditions may easily be derived directly from 


(44.270 f 


(11.12). When they are satisfied, unique values for «; and a are determined 


from the squared speeds of the two kinds of transverse waves corresponding 
to any one principal axis. E.g., 


einige tener 

SG. Q? 

2 (6,6) U7 

atk cr (1.28) 
55 = [28 — 1 (28 — 4) ag + See — 4] 4, 


For the longitudinal waves it is easier to procede directly from (11.24). In 
this way one sees that the three differential conditions (9.7) are equivalent in 
second-order approximation to a single algebraic one: 


(UA — UP) (62 — 85) + (UR, — UP) (55 — 64) + (U3; — UP) (6, — 59) =0, (41.29) 


where Uj is the squared speed of longitudinal waves according to the linearized 
theory. If this condition is satisfied, and if x; and ag are previously specified by 
(11.28) or otherwise, unique values for « and «, are determined from the squared 
speeds of any two kinds of longitudinal waves. E.g., 


(Ur': — Ui?) 62— (Ue — UP) 6 


cam UZ, — UP 3 rs 
o : O1 22 11 324 
3 2v(d,—6,) U? + 2(5.—0y) U? 2 ( ] 7 Ks XE, = al 
o, = — Uy, — U*, “ | 9) (2 | 4) os ! re 3 
(0,—0,) UP Lt . 


* To get Toupin & BERNSTEIN’s results, which presuppose a hyperelastic material, 
H%=—A+, +3, 
H &%=—2Y,, 
He tg=2(A— yoy), 
HM O%e=4 (+95) . 


put 


(G) 
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In any theory in which the squared speeds emerge as linear functions of the 
extensions, no satisfactory conditions for the speeds to be real can result, for no 
| linear function is positive-definite. Since the speeds cannot be non-constant 
) linear functions of the extensions according to any exact stress-strain relation, 
this result does not indicate any defect of the general theory. The second-order 
theory is only an approximation; the resulting formulae are expected to be 
valid only for small extensions; and for sufficiently small extensions, the second- 
j order wave speeds are always real if the first-order ones are. Thus no restriction 
on the coefficients can be found by considering the second-order wave speeds. 
Restrictions may result by applying the various conditions of §10 to the third- 
q order theory of elasticity, but I will not enter this forbidding avenue of research. 
_ The formulae and analysis of this section when compared to those preceding 


= an example of the frequent observation that an exact and general theory 


“is often simpler and easier to understand than an approximate one or the result 
of a perturbation process. It is possible, of course, but less enlightening to 
calculate the second-order wave speeds directly from the second-order stress- 
, strain relations, given as Eq. (E) in the footnote on p. 283. 


12. Hyperelastic materials 


| When there exists a strain energy 2'(x” ,, dy), the function §¥ in (2.3) assumes 
the form [CFT, Eq. (303.5)] 


| Bee Ps 
| Oy = ae (204i) 
From (2.5) we see that then , 
: Aes aa Ce Bi (12.2) 
Now by (2.8), 
Quem (2) cL Qing (12) > [Age aay A gt ag x! 5 Ny Ng: (12.3) 


Hence (12.2) implies that Q,,,=Q,,,; thus? the acoustical tensor of a hyperelastic 
material is symmetric; consequently all acoustic axes are real, and for each direction 
of propagation there exists at least one mutually orthogonal set of acoustic axes?. 


In the hyperelastic case, accordingly, much more specific results hold. When 
we apply the principle of material indifference, we may express all results in 


1 The condition is sufficient, not necessary. In order that Q;.(m)=Q,,,(m) for 
every n, it is necessary that 
Ay, ib A Pte Eek, (H) 


where 
Oa pe Lee (I) 


In view of the theorem of BERNSTEIN given below, in order to construct a non-trivial 
example satisfying Eqs. (H) and (I) it is necessary to consider materials violating 
the principle of material Hier eeees Since no linear constitutive equation can 
satisfy that principle, take },* ap a a, where ip 4, Const, then Ay, 0° = KpeyP 
and solutions of Eqs. (H) and a ee many. 

2 This theorem is due to HADAMARD [1901] [1903, 1, p. 268], generalizing earlier 
work of CHRISTOFFEL and HuGOoNIOT. 

HaDAMARD’S theorem was questioned by DuHEM [1903] [1906. Ch. I, §§ II, IV]; 
while I have not attempted to follow the long analysis of DunEm, I suspect that 
only some misunderstanding of notation is involved. 
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terms of the tensors B*”"?% and C*”?? of §6. From (6.7) and (12.2), then, 
Berri Beewn. (12.4) 


consequently, by (6.9), 


Eqs. (12.2), (12.4), and (12.5) are alternative necessary and sufficient conditions 


that an elastic material be hyperelastic. Whe 
BERNSTEIN2? has pointed out that when the principle of material indifference: 


is adopted, a definite and simple converse to the foregoing theorem comes intw 
force: If for every direction of propagation the acoustical tensor of an elastic materialr 
is symmetric, then the material is hyperelastic. For by (6.11) we see that Q*” (n) = 
Q”* (n) if and only if 

(Bhema — BmPE) nn =0. (12.5 A)! 


For this condition to hold for every m, it is necessary and sufficient that 


Brema_ Bmbka 1 BkamP __ Bmagkh _ g, (12.5 B): 


Hence 
Bekam__ Bakpm | Bemak __ Bambk _ G (12.5 @)h 


Adding these two equations yields 
TELIA LUTE SICH ES Brie _ Bimerk — Bmeka _ Berk Bikem _ Brym? (12.5 D) 


By four uses of the identity (6.10), we see that the right-hand side vanishes, 
while the left-hand side reduces to 2(B*?”"?— B*9™?). Hence (12.4) holds, 
showing that the material is hyperelastic. 


It would be useful to express the theorem and its converse directly in terms of! 
acoustical data, but this does not seem to be possible. Rephrasing a theorem oj: 
Kevin & Tait on matrices [CFT, App. §37], we see that if the squared speeds are: 
non-negative and if an orthogonal triad of acoustical axes exists, then the acoustical 
tensor is symmetric. From this fact and from BERNSTEIN’s theorem we may conclude: 
that if, in an elastic material, there exist three orthogonal acoustical axes for every. 
divection of propagation, and if the corresponding speeds of propagation ave real or 
purely imaginary, the material is hyperelastic. This result fails to be a true physical | 
converse to HaDAMARD’s theorem because purely imaginary speeds of propagation | 
do not seem to be distinguishable from complex speeds, since neither correspond to: 
real singular surfaces, while both lead to amplified infinitesimal oscillations. However, , 
it shows that the existence of orthogonal axes with real speeds of propagation for 
every direction is sufficient for an elastic material to be hyperelastic. Of course it 
is possible that in an elastic material which is not hyperelastic, the acoustical tensor: 
may be symmetric for some particular directions of propagation. For example, it 
follows at once from (7.8) that the acoustical tensor for any principal direction of 


stress in an isotropic material is always symmetric, whether or not there be a stored 
energy. 


[Note added in proof, September 9, 1961. The case last mentioned in small 
type is truly exceptional, for I have just discovered that if in an isotropic elastic 
material the acoustical tensor Q(n) is symmetric for one single direction n not lying 
im the plane of two principal directions, then the material is hyperelastic. To prove 
this result, we begin by writing out the components of T* “pq i a principal co- 


Eaabetrens dated July 28 and August 8, 1961. The paragraph beginning here 
and the following text in small type were added in proof, September 7, 1961. 
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ordinate system. Using (7.12), we see from (7.5) that 


Sn ese Bop (12.58) 
12 mY i eet, mee! 
'while the other types of components of 7*”,, vanish. Let the components of 
m be cos #,, cos J, cos 03; then from (7.7) we find that 


D t eS 
| Q3(n) = © an | 2 = Vi| Cos 9, Cos Do, (12.5.F) 
U3 


1 eo | dlogv, 
while Q{ is obtained by interchanging 1 and 2 in this formula. For a principal 
wave, cos #, cos 3, =0, so that Q3= Q{=0, as we have shown in §7 by a different 
argument. If, however, cos #, cos #,-+0, then Q0;=(Q} if and only if 
ot, Ot, 
) @logv, é@logv, 
In an earlier work [1952, Eq. (41.11),] I have shown that this equation and the 
similar ones obtained from it by permutations of indices are necessary and 
sufficient conditions for the existence of a stored energy. Q.E.D. 
If cos #;=0 but cos J, cos #,+-0, so that the wave is not a principal wave, 
but n lies in the plane of the principal directions corresponding to the principal 
_stretches v, and vy, then Q3=03=0?=(Q3—=0, but the single condition (12.5 G) 
‘is necessary and sufficient that Q03—(Q%. It is thus possible for special isotropic 
elastic but not hyperelastic materials to have symmetric acoustical tensors for 
all directions of propagation lying in one and only one of the three planes defined 
by the pairs of principal axes. 

Therefore, there are three possible kinds of isotropic materials as far as 
acoustical properties are concerned: 


=t,—t. (12.5 G) 


1. Those for which Q(n) is symmetric for the principal waves only. 

2. Those for which Q(n) is symmetric for the principal waves and also for 
any perpendicular to one and only one of the principal axes. 

3. Those for which Q(n) is symmetric for all nm, these last constituting the 
hyperelastic materials. | 

Three further definite advances over the results of the more general theory 
may now be made. First, if we set 


prt BPre, (12.6) 
from (12.4) it follows that P is symmetric. Also, if for a unit vector m we set 
m(n) = PP? n,n,, (12.7) 

from (6.11) and (2.9) we see that 
m(n) = Qn )=0 Us (12.8) 


where 6 U2? is the a‘* proper number of Q(n). eC P is real and symmetric, 
there are at least three mutually orthogonal directions m in which m(n), as 
given by (12.7), is extreme. By (12.8), then, 7 a hyperelastic material there 
exist at each point at least three mutually orthogonal directions n, such that the sum 
of the squares of the speeds of propagation of the possible waves travelling in the 
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direction n, is an extremum’. In the foregoing statement, the possibility tha 
some wave speeds are purely imaginary is not excluded. . 

Second, as is clear from (9.2), the S-E condition is necessary and sufficiena 
that all squared wave speeds be positive. In particular, if the S-E condition js 
satisfied, there can be no material singular surfaces. 

Third, the C-N condition ensures that at least one wave may travel in the direc— 
tion of n, for any n. This result was discovered by Nori’ and was proved as 
follows by ErIcKsEN®. The proper numbers of Q(m) are the extrema of the 
bounded continuous function 


FD) =O" (0) Ldn, (12.9) 
where I is a unit vector. By (10.5), F(m)>0. Hence at least one extreme of) 
F(l) must be positive; that is, @(n) has at least one positive proper number,* 


so that the corresponding wave speed is real. 
For isotropic materials it is easy to show® that the response coefficients f;; 


in (7.1) have the following special forms: 


14 Wy @2 1 Oo 4.08. Pas 4 Ports 1 Oe, ; 
ato yi CL eae hh ar tir’ 2 5 fe all’ (12.10), 
where 2'(I, II, III) is the strain energy. Hence 
oy @ OR Seae: O2n nig et ‘ 
Accordingly, conditions of integrability for 2’ are | 
O (te Oth fo\ __ 
ar (a) arto iy ei | 
1 Ofo Oh fe __ 
eyrir el” OrIT pedi le “24a 


1 hy 4 8 (it) =o 
oVilr oll ' alli \e 


By use of (9.3) and (9.6) we may express these conditions directly in terms of the: 
wave speeds: 
a Uy, — U}’s | i “ev [8 U?, — v3 U's =.) 
ol | v?(v3—v?) oll ve (vz — v2) : 
1 o | vgU2,—v3 U2 
Lewes) ¢ QU en ESS Oh ee, 
nat Bol aoa | =o 
1 é U}’s —U) 
pViir * | aLIT |v} (v}—v3) 


(12.13) 


|=0. 


We presume that all the conditions of compatibility obtained in §9 are satisfied; 
then, in particular, B, and B, and the two expressions in brackets in (11.13) 


* This result generalizes slightly a theorem of Pastor [1949, §6]. It is obvious 


3 
2 : ‘ 
that g2U; (mn), being a bounded continuous real function of n, has a maximum 
= i 


and a minimum unless it is constant. It is the existence of the third extremum 
that does not seem evident a priori. 

* Letter dated September 13, 1960. 

Letter dated June 16, 1961. In a letter of July 14, 1961, Nort informed me | 
that this proof is the same as his and disclosed a sharper inequality for F(t) | 

°* The results used here are equivalent to those given by TRUESDELL [1952 
Eq. (41.5)] and may be derived by a similar procedure. 
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‘are symmetric functions of the v, and hence are equal to functions of J, IJ, II. 
By use of the conditions of Gata one moreover, A,, Az, and Ag, as defined 
‘by (9.4), are expressible in terms of U2,, U2 3, Uy, Us, Us, their first derivatives, 
‘and v,, v2, v3; thus B, and By, as defined by (9.6); 2, are functions of these same 
quantities. The results obtained may be summarized as follows: for an isotropic 
‘elastic material, let the three possible squared wave-speeds corresponding to one 
‘principal axis of stress and strain as well as the remaining two longitudinal wave- 
speeds be given as functions of the principal stretches, then the material is hyper- 
elastic if and only if the three differential conditions (12.13) are satisfied. 

_ The conditions (12.13), while expressed directly in terms of the speeds of 
principal waves, are elaborate in form and difficult to interpret. BERNSTEIN’S 
condition, namely, that Q(n) shall be symmetric for every mn, is simple but 
does not seem to lend itself to interpretation in terms of acoustical data alone. 
It assures us that an elastic material in which three waves of mutually orthogonal 
amplitudes may propagate in any direction must have a stored energy, but 
this is not a characterizing condition. In respect to propagation of waves, there 
seems to be no especially simple property by which the existence or non-existence 
of a stored energy manifests itself without fail. 

In the second-order theory, however, a simple identity, suitable for experi- 
mental test, replaces the elaborate general conditions (12.13). Either from (12.13), 
or from the formulae (E), (F), or (G) given in the footnotes on pp. 283, 285, 
and 286, respectively, it is easy to see that for the existence of a stored energy 
-in the second-order theory of isotropic materials it is necessary and sufficient 

that the 6 first-order and second-order elasticities be reduced to 5 by the follow- 
ing equation: 


ag +05 =2 (2 — 1). (12.14) 


This condition may be expressed directly in terms of the wave speeds by means 
f (11.28) and (11.29), but the resulting identity is complicated. An equivalent 
but simpler formula may be obtained by differentiation of (11.12) and (11.24). 
Since 


2 
Qo OU, Goings Soars 
we Oo, 2 ds 
a i a )UA= 2 + 4 — hg + Hs + 2%, 

1 2 


we see that (12.14) is equivalent to the following universal criterion for a 
stored energy in the second-order theory: 


2 
Ee 8d, 5) Un - oe gees ee) 


This condition affords the first opportunity, so far as I know, for a clear-cut 
and unequivocal experimental test of the existence of a stored energy in an 
isotropic material’. It is different from the previously derived conditions of 


7 In the linearized theory, every isotropic elastic material is hyperelastic. In 
the finite theory, this is not so; while practically all previous researches on the finite 
theory concern hyperelastic materials, I know of no case in which the existence of 
a stored energy is necessary for the results obtained. In fact, many of the solutions 
derived by the British school could be expressed more simply in terms of the coetfic- 
ients hp, independently of whether there be a stored energy. 
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compatibility in that a connects a longitudinal-wave speed with a transverse-wave « 
speed. There are in fact 6 conditions of this kind, but, on the presumption that | 
the general identities (11.27) and (11.29) are satisfied, any of them implies = 
the rest. In this sense (12.16) is necessary and sufficient for the existence of ay 
second-order stored energy. If the partial derivatives are evaluated in the natura! ] 
state, (12.16) is also a necessary but not sufficient condition for the existence = 
of a stored energy according to the general theory. 

In order to test (12.16) properly it is necessary to let 6,, 6, and 6; vary inde-- 
pendently. It is impossible, for example, to find a relation corresponding to (12.16) } 
by considering a one-parameter family of deformations such as that used by ToOupPIN ¢ 
& BERNSTEIN and explained in §11. It may be preferable, however, to reduce the: 
number of independent variables from three to two. For example, if we employ / 
only deformations such that 6,= 46;, then it is easy to show that instead of (12.16) ) 
we have the alternative necessary and sufficient condition 


a 2 | 
oe +6U?=0, (12.17) } 
06, 


a) DN XG 


2 
oom a5; ) i 


but 6, and 6, must be allowed to vary independently. 


13. Duhem’s theorems on the effect of temperature variation 


In general the stress-strain relation (2.3) involves an additional scalar para- - 
meter such as the temperature, #, or the specific entropy, 7: | 


i= oe d4,7) = We (5s d,,0), (13.4) } 


where f and § are different functions. The results obtained so far are valid! 
when 7 = const. or when # = const., but the tensors A, defined by (2.5), and the : 
acoustical tensors, Q, defined by (2.7), are different in the two cases and may * 
be distinguished by the notations A, A, 0, O. 


As far as I can learn, the only treatment of more general conditions, when ' 
temperature and entropy may vary within the body, is that given by DuHEM! 
[1903, 2] [1906, Ch. I, §2]. I have despaired at following his long calculations, , 
made more difficult from an unfortunate choice of variables, but here I present - 
my own treatment of the same problems, leading to results which seem to be: 
equivalent to his. 


First, if we substitute (13.1) into (2.4), we obtain equations of motion of the 
following forms: 


Al (aB) ..m ane m abe am ~ ~ os 
Bem P04 Cane , dn hm at Of, =O Xp, 


(13.2) 
ONR m | 


A, (8) 4” oe Obg 9 m ~ pt wes 
km eae) aa” ra teed a a0 sm & wOdp =O Me. 


Taking the jump of these equations at a singular surface leads to propagation 


conditions of the form (2.7) not only if ™ = const. or if #= const. but also 
under the weaker assumptions 


[ml =0, [8 n] =0, (13.3) 


respectively. We now consider physical circumstances such as to render valid 
one or the other of the relations (13.3). 
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First, we review a classical argument, given in effect by Fourter [cf. CFT, 
}296|, which shows that if [9] =0 and if Fourter’s law of heat conduction is 
issumed, then (13.3), follows. For by MAxweEL’s theorem [CFT, Eq. (175.8)], 
he condition [8] =0 implies that [9 ,] = B,. Fourter’s law of heat conduction 
isserts that the flux of energy, h, depends linearly on i thats, hie Uys 
vhere K is positive-definite. Hence [h,,]=BK,?n,, where K is assumed con- 
inuous, and consequently n - [h] = BK,? n,n”. But the principle of conservation 
« energy requires that n- [h] =0, provided the internal energy be continuous. 
dence B=0, whence (13.3), follows. 

_ The foregoing argument has nothing to do with elasticity, but we may sum- 
narize its implication in an elastic context by the following theorem on elastic 
conductors: In an elastic material subject to FOURIER’S law of heat conduction, 
he laws of propagation for waves of second order across which the temperature, 
internal energy, and heat conductivity are continuous are the same as those of 
2omothermal waves. Similarly, if we consider a wave of third order across which 
5, 0,,K,?, and K,? , are assumed continuous, from the differential form of the 
equation of energy we conclude that [0 ,,,]=0, whence it follows that the laws 
of propagation are again those of homothermal waves. Parallel assumptions 
ead to a parallel theorem for waves of higher order. 

~ Second, assume that there is no conduction of heat at all: h=0. Then! 
7=0, and consequently [7] =0. If it is assumed that [7] =0, then by HADAMARD’S 
conditions [CFT, Eq. (180.5)] we infer that (13.3), holds. Thus we have the 
following theorem on elastic non-conductors?: in an elastic material that does 
not conduct heat, the laws of propagation of acceleration waves are the same as for 
the homentropic case. Parallel assumptions lead to a parallel theorem for waves 
of higher order. 


We may relate the tensors A and A if we assume an equation of state for the 
temperature: etree 
f 8 =S(0" 5, d4,7). (13.4) 


Then from (13.1) and (2.5) we see that 
one 2d 


fet bend wee, 
Ay SSA pay a ae Clap 


(13.5) 


While this result is not illuminating in general, for hyperelastic materials it can 
be put into a form having immediate interpretation. In the hyperelastic case 
there is an equation of state for the internal energy, 


SES Neen Diag) es (13.6) 
while ~ aé aS 
Ka = O = 0 — es eee 4 . 
Ty’ = by =6 ant’ b=0 = (13.7) 
nig . se 
[cf CFT, §256A]. Then 56 , binge 


(13.8) 


1 For hyperelastic materials, this may be proved [c/. CFT, §256A]; for more 
general elastic materials, it is offered as a hypothesis. 

2 The special case of this theorem for gas dynamics is familiar: The speed of 
sound for isentropic flow is the same as for a homentropic flow at the same thermo- 
dynamic state. 
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But by substituting (13.7); into (13.1). and differentiating with respect to i 
we see that b a 
Of?  P 06 8 OHnf 


Soh = at; (13.07 
én 0d On x OO 


where x is the specific heat at constant deformation. Substituting (13.9) inte 
(13.5) yields a fundamental relation between the homentropic and homothermaz 
elasticities of any hyperelastic material: 

0 aby OB 
x6 00 80 


A pnt? = Ange? 4 (13.106 


Using (2.8), we derive DunEm’s fundamental relation between the two: 


acoustical tensors: ~ = Oo } 
Quem = Qamt ry Fi Em (13.117 

where raed 
D ae A x? Np. (13.420 


Thus the properties of homentropic waves may be calculated directly from the 
stress-strain relations in which temperature is the thermodynamic parameter: 
Moreover, the tensor P, P,, is always non-negative definite, for any . There¢ 
fore, since #/% =0, it follows that if Q(n) is positive-definite, so is Q(n). Thus: 
as remarked by DuHEM, if, for a given hyperelastic material subject to given dex 
formation, three independent waves may propagate in the direction n if } = const.: 
then the same holds if y = const. In other words, if the homothermal wave speedé 
are real, so are the homentropic wave speeds, but the converse need not hold: 


14. Remarks on other researches 


In this paper I have not attempted to summarize everything known regarding 
waves in severely deformed elastic materials!, but some topics should be listec 
for reference. | 

1. The connection between HADAMARD’s stability and wave propagation: 
already mentioned. 

2. JOUGUET’s presentation of the general shock conditions in an elastic contex? 
and analysis of weak shocks [41920, 1—4] [1921]. 

3. L. BRILLouIn’s elaborate study of radiation pressure and other second! 
order effects [1925] [1938, Ch. XI, §§ VII—XVI]. | 

4. ERICKSEN’S analysis [1953, 1] of waves in isotropic incompressible hypert 
elastic materials. By combining the approach of EricKsEN with that presentec 
in §§ 2—6 of this paper, a complete theory of waves in general incompressibld 
elastic materials is easily constructed. Such waves are always transverse [CFT’ 
§191]. For isotropic materials the wave speeds are given by (7.10); the relation 
(7.15), the conditions of compatibility (9.1) and (9.2), and the determination 
of the stress-strain relations from the wave speeds by means of (9.3) remain 
valid but may be simplified a little since vjvjvg3=1. The second-order theory 
for a general isotropic incompressible material may be read off from ERICKSEN’s 


a Such a summary will appear in the article by W. Nott & C. TRUESDELL, Thé 
Non-linear Field Theories of Mechanics, Friiccr’s Handbuch der Physik, Vol. 8} 
part III, in preparation. 
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‘exact theory for a Mooney material, since such a material serves as a general 
second- order approximation. In addition to the 4 conditions of compatibility 
(41.27), there is 1 more: 
Uj, + UP, —2U? UP, — Us 
| b, d.—0, 


AU ee (14.1) 


‘Thus any one speed determines all possible speeds; furthermore, the single 
second-order elasticity is also determined by any one wave speed; in fact, it 
mcs to consider a one-parameter family of deformations. As is well known, 
“every isotropic incompressible elastic material is also hyperelastic as far as the 
second-order theory is concerned. 


5. Toupin & BERNSTEIN’S conditions of compatibility for waves according 
to the linearized theory of anisotropic materials [1961, 1, §4]. These are easily 
generalized, in slightly modified form, to the case when there need be no stored 
energy. 


I am happy to acknowledge the support of the U. S. National Science Foundation 
‘through a fellowship at Bologna for the period during which this paper was written. 
‘It is a pleasure to express my gratitude also to Messrs. BERNSTEIN, ERICKSEN, NOLL, 
:and Toupin for their criticism and help. 


Note added in proof, September 19, 1961. Shock relations in hyperelastic materials 
»are derived and discussed qualitatively by J. Dewry: Strong shocks and stress-strain 
relations in solids. Part I. Ballistic Research Laboratories Report No. 1074, Aberdeen 
“Proving Grounds, Maryland, April, 1959. Plane infinitesimal oscillations in a bi- 
axially stretched isotropic hyperelastic material that conducts heat according to thermo- 
elastic constitutive equations, 


S=Z(B,9), h=K-V9, 


where K is an isotropic function of B and V®%, have been studied by J.N. Fravin& 
-A.E. Green: J. Mech. Phys. Solids 9, 179—190 (1961). Except in limiting cases, 
the contents of these papers does not overlap the research presented above. 

Not included in the list of earlier works cited by Toupin & BERNSTEIN are the follow- 
ing papers on special theories of waves in stressed materials: Bircu, F.: The effect 
of pressure upon the elastic parameters of isotropic solids, according to MURNAGHAN’S 
theory of finite strain. J. Appl. Phys. 9, 279—288 (1938). — Biot, M.A.: The influence 
of initial stress on elastic waves. J. Appl. Phys. 11, 522—530 (1940). 

The somewhat perplexing conclusions of these authors regarding waves in bodies 
‘subject to great hydrostatic pressure are cleared and unified by the exact theory 
given in §8 of the foregoing paper. 
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Small Deformations Superposed on Large Deformations 


{ 


| in Materials with Fading Memory 


A.C. PIpKIn & R. S. RIVLIN 


1. Introduction 


: The non-linear stress-deformation relation for materials with memory has 
been discussed by GREEN & RIvLIN [J] and GREEN, Riviin & SPENCER [2]. 
In the present paper we derive the restricted form applicable to small dynamic 
deformations superposed on large static deformations. Although this can be 
| done by making an appropriate linearization of the final result of the non-linear 
theory, we avoid some of the complexities of that theory by introducing the 
linearization at an earlier stage. The results obtained will apply in particular 
_to problems of wave propagation in finitely deformed viscoelastic bodies. 


It is assumed that the stress depends on the deformation history, and that 
the properties of the material do not change with time. The stress components 
“are then hereditary functionals of the deformation gradients. Because rigid 

rotation of a deformed body has no effect on stress, other than to rotate the 
stress field, and since the stress-deformation relation must embody this fact, 
the relation is of a certain restricted form. This form was derived by GREEN & 
RIVLIN [7] by using invariant-theoretic methods, and subsequently No tt [3] 
derived an equivalent form. Nott’s derivation is simpler than that of GREEN & 
-RIVLIN in that it does not make use of invariant theory, but the form he derives 
is not suitable for the purposes of the present paper. In Section 3 we take the 
opportunity to present a derivation which combines some of the simplicity of 
NoLi’s method with the explicitness of GREEN & RIVLIN’s result. 

When the deformation consists of an infinitesimal displacement field super- 
imposed on a finite deformation, the stress-deformation relation can be linearized 
with respect to the gradients of the infinitesimal displacement field (Section 4). 
In a material with fading memory, if the finite deformation is held fixed long 
enough so that the original deforming process has no further effect, the non- 
linear functionals specifying the stress can be expressed in terms of non-linear 
functions and linear functionals (Sections 5 and 6). 

The functions appearing in the linearized formulation are subject to restric- 
tions in form if the material has any symmetries in its undeformed state (Sec- 
tion 7). In the case of isotropic materials (Section 8), these restrictions are such 
that the constitutive equation can be brought into a simpler form involving 
the classical strain tensor (Section 9). 

21% 
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2. Materials with memory and hereditary functionals 


Consider a body in its undeformed state. Let the coordinates of a generics 
particle be X;, in a fixed Cartesian coordinate system x. If the body is deformed; 
as time progresses, then the particle which was originally at X; moves to a news 
position x, at time /, and the deformation may be specified by giving 4%; as a: 


function of X, and ¢: SM (24) 


The derivatives 0%,/0X,, called deformation gradients, may be used as measures 
of the deformation in the neighborhood of the particle X,. 

Suppose that the stress at a given particle depends on the deformationr 
gradients at that particle, not only at the instant ¢ considered but also at alll 
previous times. The stress components o;, at time ¢ are then functionals of thee 
deformation gradients at the particle considered. To express this idea, we write 

co 
07; (2) a fi; [dX ( aes t] . (2.2) 
33 

Explicit dependence of the functionals on ¢ indicates that the intrinsic pro-- 
perties of the material may change as time progresses, whether the material. 
is subjected to a deformation or not. This possibility will immediately bes 
rejected, however, for we assume that the material has the following property: 
if two deformation histories differ only by a time shift, then the corresponding = 
stress histories differ only by that same time shift. It is easy to see, and cor-- 
respondingly easy to show rigorously, that this assumption implies that the: 
functionals cannot depend on ¢ explicitly. Thus in place of Eq. (2.2) we write: 


o,,(0) =fislOxp lt — DIX). 23)! 


The functionals are then said to be hereditary. 


3. Rotation of the physical system | 
Not every relation of the form (2.3) is physically meaningful. When an: 
arbitrary rigid rotation is superimposed on a given deformation history, the: 
stress field undergoes an equal rigid rotation. The form of the constitutive : 
equation must be such that this condition is satisfied. | 
Let x; be a deformation history obtained by superimposing a rigid rotation: 

on the deformation »;(t), so that 


%,(X»,t) =a; ;(t) x; (X,, 2). (3.4) 

Here the coefficients a;;() must satisfy the orthogonality conditions | 
Gin (t) ain (0) = ag s(t) aj) =6;;, | a,;()| =1. (3.2) 

From Eq. (3.1), the deformation gradients for the new deformation are 
0%; (t)/OX» = a, ;(t) Ox, (t)/OX,. (3.3) 


D 


Letting @; ;(¢) denote the stress components associated with the deformation %; (é), 
Eq. (2.3) implies that 


O;; () = f,;[0%, (¢ — 1)/0X,]. (3.4) ) 


t=0 
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__ The condition to be satisfied is that the components 6; ;(¢) are related to the 
|}components o;;(t) by the rotation a;,(¢). Thus, 


6; ; (t) = 4;, (t) aj1(t) Or ()- (3.5) 


By using Eqs. (2.3) and (3.4) in Eq. (3.5), and making use of Eqs. (3.2) and (3.3), 
we obtain 

| FijlOxp(t Se = y(t) a1; () fel, (¢ — 7) wis (¢—7)/eX,]. (3.6) 
‘The functionals 7;; must be of such form that Eq. (3.6) is satisfied no matter 
what the choice of a;;(t) may be, subject to the restrictions (3.2). 

_ We may in particular let a;,,(t) be the 7-component of vector number 7 in an 
, orthogonal system, formed from the three vectors 0x;/0X,, 0x,;/OX,, and 0x,/0X3 
by the Schmidt orthogonalization process. In the first step of this process, 
.4,;(¢) is constructed by dividing 0x;(¢)/0X, by its length. We thus obtain 


a, ;(t) = [Gi] 0%; (t)/OX,. (3.7) 


| Here G,,(¢) is one of the set of inner products G,,(¢), called strain components, 
which are defined by ax,(t) Ox; (t) 
: Gyq (t) cs Lexie 0X, =F (3.8) 
Next, we construct the unit vector in the plane of 0x;/0X, and 0x;/0X,, per- 
-pendicular to 0x;/0X, and forming a positive inner product with 6x;/0X,. Using 
‘the notation 4,,(¢) for the cofactor of G,,(¢) in the determinant G (#)}=|G,,(4)|, 
this vector may be written 


Ay ; (t) = [Gy 1 (2) As3(t)]~? — Gyo (t) OX, (t)/OX, + Gy, (2) OX; (f)/OX,]. (3.9) 


Finally, a; ;(¢) is taken to be the unit vector perpendicular to 6x,/0X, and 6x,/0X, 
whose inner product with 0x;/¢X3 is positive. Thus, 

i Ox; (t) 0x; (t) 

. pe 4 hard ie nore o/s 

as; (t) =[A33(#) G(¢)]~* | 413 4) aX, + Ap (%) aX, + As; (2) aXe Whi (3.10) 


We note that G,,(t), 43 (¢), and G(¢) are positive in any deformation possible 
in a real material. Equations (3.7), (3.9) and (3.10) may be abbreviated as 


a; ;(t) = Cin (t) Ox; ()/OX,, (3.41) 


where the coefficients C;,(¢) are functions of the strain components G,, (é). 


Since the coefficients a, ,(¢) constructed in the manner described above satisfy 
the restrictions (3.2), Eq. (3.6) must be satisfied for this choice of a;;(¢). Thus, 
if Eq. (3.11) is used in Eq. (3.6), and Eq. (3.8) is used to simplify the expression 
for the functional argument, we obtain 

0 dx;(t) axj(t) , - eB 

fagl@xp(t — 212%] = Com) Gee Cin Gg furl Cpr = 2) Grglt — 2] 


¥ 10 
0x; (t) ax; (t) 
OXm OX, 


(3.12) 


EynlGpall—1)] (629). 


The second member of this equality is of the form shown in the last member 
because the coefficients C;,(é) are functions of the strain components Gy, (?). 
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It is thus seen that if Eq. (3.6) is satisfied, the functionals f;; can be written 1 
in the form (3.12). It is easy to show that, conversely, if the functionals are » 
of the form (3.12), then Eq. (3.6) is satisfied for all orthogonal transformations. , 
The constitutive equation is therefore of the restricted form 


; 0 
0,0) = a0) HO) FG p4l¢— I]. (3.43): 


We note that since the stress tensor is symmetric, f,; =,. 


4, Small deformations superimposed on large deformations 
Consider a deformation obtained by superimposing a displacement field | 
éu;(X»,t) on the basic deformation *;= %; (X,, #): 


xi (X,,t) = x; (Xp, t) +e u,(Xp,0). (4.4) 
The stress components corresponding to this deformation are, from Eq. (3.13), 


oi) = te | Flea — t)], (4.2) , 


where the strain components G;,(¢) are those obtained by replacing ~;(¢) by 
x¥(t) in Eq. (3.8). We wish to obtain the linearized form of Eq. (4.2), valid _ 
for small enough values of «. | 
By using Eq. (4.1) and neglecting terms of order ¢?, we obtain 

OxF(t) Oxf (t) _ Ax,(t) Ox; (t) du;(t) 0x; (t) dx; (t) eu; (t) (4.3) 

Jo NER = Okeke. || eons axe COR, We i 
The strain components G;,(t) appropriate to the new history can be found by 
setting 7=7 in Eq. (4.3). Thus, 

#1 (t) = Gpi(f) + 2€ Ey, (2). (4.4) 


Here the functions G,,(¢) are the strain components for the basic deformation | 
x, (¢), and the functions E,,(¢) are defined by 


_ Ouj(t) 0x; (t) 0x; (t) Ou, (t) 
2£,,(2) OX, OX at OX peewee (4.5) 
We now assume that the functional F,,; is particularly dependent on the 
value of Gj,(¢—t) and its time derivatives at r=0, and is a continuous func- 
tional of G3, (¢—t) over the range 0<t<oo. Using the notation 


Ga =Gp,(t), GR = a" GE, (/at", (4.6) 


we suppose that /,, may be expressed as the sum of a nee of terms, each 
of which is the product of a differentiable function f,, say, of GE (w=0,.1,2)080m 
and a continuous functional ®,, say, of a (¢—t) over hes range 0< 1% =< og 
Then, 
FulGh,(— 2] = Lhl Gq") ®,[GS, (¢—7)). (4.7) 
tT=0 


Using Taylor’s theorem, we have, with (4.4) and the neglect of terms of order 
higher than the first in e, 


R(GsY ) = f, (GY a) +26 DEB Of/OGF acihis (4.8) 
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‘Tn (4.8) we use the notation 
i 

EM = Ea), Ee = d" EB, (t)/dt". (4.9) 
_ By using the integral representation for continuous functionals, it is seen 
that neglecting terms of order ¢?, the functional ®, may be expressed in the form 


[o,0) 


: ®,1h,(t — t)] = ©, (G,,(¢—1)] + ef PD) [Gl —&);t] E,,(¢—t)dt. (410) 
| = t=0 =—() 


By using (4.8) and (4.10) in (4.7), and again neglecting terms of order <2, 
we see that F,, is expressible in the form 


Fi [Gp. a(t — t)]=4i1G pal = 

Er i & 

| +e DAR [Gog (é — 0); Ce] Be () + (4.41) 
7T=0 

| Si a Fairs 1G og’ — 8)5G Hp T] E, .(¢ acon t) d T. 


The constitutive equation for small deformations superimposed on large 
deformations is now obtained by using Eggs. (4.3) and (4.11) in (4.2), again 
neglecting terms of order ¢?. Since ¢ was introduced only to facilitate the lineari- 
zation process, it will be omitted after this step; 7.e. we take «=1 with the 
stipulation that the displacement gradients and their derivatives are small. 
The linearized equation is then 


__ [ Ox; (t) 0%; (2) du;(t) Ox; (t) 0x,(t) Ou; (t) pee 
Bs3\) =| ax, ax; amy 20x, ax, oxy [Pee — 
dx; (t) x; (t) 4 a y 
+ Gx ax, piece t); Gf) EM) + (4.12) 


4 (t) au Se. d 
Ae oe bf Kun Cog — EN Gee) T. 
Since the stress tensor is symmetric, then F,,=—F,, Airs= Abr s and 
Me 75 Arers- once E“ —E), there is no loss of generality in taking AY =A 
and K,;,;=Kgi5, as well. 


sr? 


5. Materials with fading memory 


If the basic deformation x;(X,,¢) were time-independent, the functionals 
F,;, AY,,, and K;;,,; in Eq. (4.12) would reduce to ordinary functions of the 
constant strain components G,,. The basic deformation is, of course, necessarily 
time-dependent, because it must involve a transition from the undeformed state 
to whatever fixed state is contemplated. However, the same simplification is 
achieved if a material with fading memory is held in a fixed state of deformation 


for so long that the original deforming process is forgotten. 
Let the basic deformation x;(X,, ¢) in Eq. (4.1) be one in which the body 
remains in a fixed state x;(X,) after time ¢=0; 1.¢. 


x;(X»,t)=%,(X,),  #>0. (5.4) 
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Using the notation G,,(¢) for the strain histories, as before, let G,, denote the: 
constant strains associated with the fixed deformation x;(X,). Since Gp, (t)=G,, ¢ 
for {> 0, then the derivatives Gi) (w=-0) in Eq. (4.12) all vanish. ; 

We assume that after a sufficient amount of time has elapsed, the defor- 
mation history before ¢=0 has an arbitrarily small effect on the stress. This: 
assumption requires that the following condition be satisfied by the functionals s 
F,, which appear in Eq. (4.2): 


t 


lim {F,,[G,,(¢— t) + 2E,,(¢—1)] =F; [Gpg+ DE ple t)]}=0.  (5.2)) 
t—>0co t=0 7 
As a particular case of Eq. (5.2), we have 
Jim Fj[Gpq(¢— 2)] =FilGpq] = Fis Goo) (629). (5.3)) 


As indicated, the functionals reduce to functions of the constant values G,, in 
this case. By considering the linearized form of Eq. (5.2), 1.e. the form cor-- 
responding to Eq. (4.12), it can be shown that the functionals A!”,, and Kj;,;, 
also reduce to functions of G,, at large times. 
When the deformation has been held fixed long enough that the history / 
before =0 has no perceptible effect, the functionals F;;, A!?,,, and K;;,; may’ 
be replaced by their limiting forms, which are functions. The constitutive = 
equation (4.12) then assumes the form | 
; . : ; x, @Ou(t 
oi) =[aey oxy tak, aay + axe aay (Te God + 


(5.4) | 


OW, «0X; F 7 > 
he aX, OX > AP) AG ra) E) (?) + f Ksirs(Gpos0) Pik lte- T) adt\. 
v 0 i 


Here x; denotes the basic steady deformation, with associated strain compo-- 

nents Gy,. | 

In passing, we note that if u,; is steady, then the combination x;-+-u; can be 

regarded as a new steady deformation with no unsteady part superimposed. . 

The stress given by Eq. (5.4) must be the same, whichever way the deformation ! 

is regarded. From this consideration, we find after some manipulation that the ‘ 

following relation must be satisfied: 

AGL (Gog) + f KejnslGpq. 1) dx = 2 PO (5.5) 

0 

In the remainder of this paper we will consider only those materials and_ 
deformations for which Eq. (5.4) is valid. 


6. An alternative approach to materials with fading memory 


In Sections 2 to 5 we have developed the constitutive equation for small 
dynamic deformations superimposed on large steady deformations in materials. 
with fading memory. This was done by specializing the constitutive equation 
for materials with an unspecified type of memory. An alternative approach is 
possible, in which it is assumed initially that the deformation consists of a steady 
deformation x;(X,) with a displacement field u;(X,, t) superimposed on it. It 
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_ is then assumed that the stress components depend on the deformation gradients 


0x,/0X, and the history of the displacement gradients du, (t)/0X,, so that 
6; ; (2) = fislOupt ¢ — t)/OX,; 0x,/0X,]. (6.1) 


To find the restricted forms of /;; which will satisfy the rotation invariance 
principle, we consider a new total detonation x, (¢t)=x,+4u,(t) related to the 
original total deformation x; (t)=*;-+u;(¢) by a rigid rotation, 7.e. by Eq. (3.1). 
The stress components corresponding to the new deformation are given by 


&;;(t) =f; (Oi, (t — 1)/OX,; O%,/0X,]. (6.2) 


The new stress components 6; ;(¢) must then be related to the components o;, (2) 
by the transformation (3.5). By using Eqs. (6.1) and (6.2) in Eq. (3.5), and 


_ making use of Eq. (3.2), we obtain 


[e.e} 


UN ag a t)/OXq; Oxp/OX 4] = ay ;(t) 40) fal ip t= 2) 0X,; 0x,/0X,]. (6.3) 

At this St neither x; nor %;(¢) has been defined. Only their sum, %;(¢), 
has been defined by (3.1). The new total deformation %;(¢) can be decomposed 
into a steady part and an unsteady part in either of two essentially different 
ways: (i) the steady part %; may be regarded as identical with the steady part +; 


_ of the original deformation, in which case the entire rigid motion is included in 
the unsteady part w;(t), (ii) the new steady part 7; may be related to x; by a 
time-independent rigid rotation, with w;(t) related to w;(t) by the same rotation. 


In case (i), we take ¥;=x;. The displacement field 7;(¢) is then defined by 
U4; (t) =%,(t) — %; = a,,(t) «;(t) — %;. (6.4) 
Equation (6.3) then becomes 
15; Ou, (¢ — t)/OX,; Ox,/0X,] 
t=0 


oo (6.5) 
= a, ;(t) %;(¢) fril@pr(¢ — 7) oot ST) OX sg ON |G gs 0%,(0X4] . 


By constructing the coefficients a@;;(¢) from the gradients 0x;,(t)/oX;, as in 
Section 3, we find that /;; must be expressible in the form 


Ox; (t) Ox; ( 5 
yes 1s — ) Re [Gpalt — a 7); 0% OX | - (6.6) 


When Eq. (6.6) is used in the basic invariance requirement (6.3), and Eqs. (3.1) 
and (3.2) are used to simplify the resulting expression, it is found that the 
functionals F;* must satisfy the condition 


FS [Gpal — 2) éx,[0X,] =F *[Gpalt— 1) 0%,[0X,], (6.7) 


where the strain components Gy, (t) are defined by replacing x;(t) by %,(¢) in 
Eq. (3.8). By making use of Eqs. (3.1) and (3.2), it is found that 


Gyo (t) = Gog (t)- (6.8) 
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We now consider case (ii). The coefficients a;; in Eqs. (3.1) and (3.2) are 
taken to be constants, and the new steady deformation x; is defined by 


Ej = aj; %). (6.9) | 


CS ae 
By using Eqs. (6.8) and (6.9) in Eq. (6.7), we find that the functionals &* must 
be of such form that 
F*[G,q(¢ — t); 0%,/0X,] = iF de dg T); Ap, Ox,/OX,). (6.10) 
t=0 t= 
By constructing the coefficients a; ; from the steady deformation gradients 0x,/0X, 
in the manner described in Section 3, we find that the functionals #;* depend 
on the gradients éx,/2X, only through the strain components G,, defined in 
terms of them. By using this result in Eq. (6.6), and the result so obtained in 
Eq. (6.1), we obtain finally 


oi) = ae sat FuilGoglt — 1); Goal: (6.11) 

The displacement gradients du, (¢)/0X; enter into Eq. (6.11) through 0x; (t)/6X; 

and G,,(t—t). In general, the displacements w;(t — t) cannot be small throughout 

the entire history of deformation, since x;+u,(¢— 1) =X; for t sufficiently large. 

Thus, before Eq. (6.11) can be linearized with respect to the displacement gradients 

it is necessary to assume that the material has a fading memory and that enough 

time has elapsed so that the deforming process has been forgotten. With these 

assumptions, Eq. (6.11) can be linearized, and the form obtained is again given 
by Eq. (5.4). 

7. Symmetry of the material 
Certain restrictions must be placed on the functions F,;, A‘,,, and K;;,, 


: eit ake yA 

in (5.4), if the constitutive equation is to describe a material which has some 
symmetry in its undeformed state. Let % be a fixed rectangular Cartesian co- 
ordinate system, related to the system x by an orthogonal transformation, so 


that 
where 


Sik Sip = Spi Shi = O5;- (7.2) 


Defining Ce Er and E®) () as in Egs. (3.8), (4.5), and (4.9) respectively, 
but in terms of the new coordinates ¥; and X;, and making use of Eqs..(74) 
and (7.2), we obtain 


Gy 5 = S54 S51 Gpp, Ev) (t) = S548, EO). (7.3) 


The components G;;() of stress, measured with respect to the system %, are 
given by 


al 


ijt) = S5p Sj, 0p) (2). (7.4) 


Tf $=|s,;l| is a symmetry transformation for the material, then Eq. (5.4) 
remains valid if all quantities are measured with respect to the system x instead 
of the system x. By using Eq. (5.4) and the corresponding expression for G; ; (2) 
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_ to eliminate o,;(t) and @;,(¢) from Eq. (7.4), a relation involving F,,, AW. and 
K;;,; is obtained. From that relation and Eqs. (7.1) to (7.3), it can be shown 
| after some manipulation that the following conditions must be satisfied (for 
details of a similar manipulation in the isotropic case, see [1] or [2]): 


| Fi (Gpq) = Six 851 F41(G q)» (7.5) 

j AY), «(Gpq) EO (0) = Sin 8; AL), (Coq) EMO), (7.6) 

es i | 
K (Gog; T) EP st = T) = Siz S51 Bigs (Gey; T) E, .(¢ i T) 2 (7.7) 


| These relations must be satisfied identically for each transformation s belonging 
, to the group of symmetries of the material. In the following section we will 
: consider the consequences of Eqs. (7.5) to (7.7) in the case of isotropic materials. 


8. Isotropic materials 


The symmetry group for an isotropic material is either the full orthogonal 
_ group or the proper orthogonal group, depending on whether the material has 
/a center of symmetry or not. Presence or absence of a center of symmetry is 
irrelevant in the present context, however, because Eqs. (7.5) to (7.7) are satis- 
fied identically if the transformation is the central inversion, 7.e. if s, j= — 0;;. 
RiIviin & ERICKSEN [4] have studied the implications of an equation of the 
_ form (7.5). They have shown that if the functions F;; are single-valued functions 
of the variables G,,, and if Eq. (7.5) is satisfied for every orthogonal trans- 
formation s, then the matrix F=|F,,|| can be expressed in terms of the matrix 
| G=|G;,| in the form 

. 


iil 

F=RI+FG+E G*. (8.1) 

Here I is the unit matrix |6;; The scalar coefficients &, F, and F, are single- 
valued functions of three basic orthogonal invariants of G, 

GG; 


4%? 


trG?@=—G,,G;;, trG?=G;,,;G;, G,;. (8.2) 
If, in (7.5), F;; is a polynomial* in its arguments, then F may be expressed in 
the form (8.1), with /j, /,, and J expressible as polynomials in the invariants (8.2). 

The limitations on K;;,, E,,(—t) implied by (7.7) may be obtained in more 
explicit form by considerations of the type used by Riviin & ERICKSEN [4] 
and by Riviin [5]. We use the notation 


Kj; (Gpq, dys) = K 557s (Gpq) ER, (8.3) 


the dependence of E,, on ¢—t being understood. Then, Eq. (7.7) may be written 
in the form La Sd 
K AG sas LH, 3) = Siz Sita (Gx, Es): (8.4) 


If we assume that K;;,, is a polynomial in G,,, so that K;,; is a polynomial 
in G,, and £,,, linear in E,,, it follows (Riviin [5]) that K=|K;,,| may be 


*x We note that if Fj; is a continuous function of G,,, it may be expressed as a 
polynomial as closely as desired in any closed domain. 
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expressed in the form 
K=K,E+K,(EG+ GE) + K,(E G? + G’E) + 


- > Key ltt Ea. 
M,N=0 
The coefficients Ky, K,, K, and Kyy are polynomials in the invariants (8.2), 
and functions of t. If these coefficients are regarded as single-valued functions 
rather than polynomials, Eq. (8.4) is still satisfied. 
Since Eq. (7.6) is of the same form as Eq. (7.7), its implications are analogous 
to those of (7.7). We use the notation 


AY) (Gyq, EM) = AY, .(Gpq) Eft ()- (8.6) 


(8.5) | 


avs VS 
By an argument similar to that used in deriving (8.5), we find that A® =||A?)] 
can be expressed in the form 
A” = Aw) BE + A®) (E” G+GE)+ 


2 8.7 
+ AY) (BE G2 + G2 EB") + Abr (tr BO G*) qu SA 


The coefficients A’, A?), Af?, and A$), are polynomials in the invariants (8.2). 
Again, the invariance requirements are still satisfied if these coefficients are 
regarded as single-valued functions rather than polynomials. 

When Eq. (8.5) is used in (8.3), (8.7) is used in (8.6), and (8.1), (8.3), and (8.6) 


are then used in Eq. (5.4), the constitutive equation for isotropic materials is | 


obtained. It may be written as 


_ | 0%; Ox; | Ouj(t) Ox; | Ox, Ouj(t) 
o;;(t) lax. xe OX, OG books ax] [Fo Ons + Gui t+ Fe G5 Gi] + 


Wax MEN 4p 9: 51g + AD? (Bap Gig + 19 Gep) + 


2 
a Ay’ (Ox Gi, Gyq Ae O1q Cry G,p) 5 oo Avy (G") x1 (G) 94 Ey () + (8.8) 


M,N=0 P 
OX; Ox; 


ae OX, OX) [Ko (t) Oxp Org = ky (t) (xp Gig ats Org Grp) te 
0 


2 
+K0) (kp Gir Gra O14 Gur Grp) + & Kun t) (Gas (@™) pg} Epg(t—0) dt 
M,N=0 


Dependence of the scalar coefficients on tr GY (N=4, 2, 3) is understood. 


9. An alternative form for the constitutive equation for isotropic materials 
Equation (8.8) can be written in a slightly simpler form. As the first step, 
the displacement gradients are written in terms of spatial derivatives, thus 
(0U;(t) > Ou;(t) Ox, 


OX; OX, Ox; g (9.1) 


Here the derivatives 0x,/0X; are deformation gradients for the basic steady 
deformation. By using (9.1) in the definition (4.5) of E,,, we obtain 


OX; Ox; 
Ep, (t) nice aX, e:;(t), (9.2) 
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_ where e;; is the classical strain tensor defined by 


4 [ 0u,(t) 0u; (t) 


| ent) a5 ax; Ox; | (9:3) 
| The time-derivatives 1B are, from (9.2), 
Ci, On, 
(v) ES a9. . 
q EQN) = se oe PO), (9.4) 
where 
| el) (t) = de, (t)/dt. (9.5) 


In the definition (4.9) of a and now in Eq. (9.5), the differentiation d/dt 

| holds X, fixed. Thus d a duj(t) 2 9.6) 

dt Ota au laa ONE : 

' When this expression is used in (9.5), terms arise which are quadratic in the 
displacements. These will be neglected. Thus, in (9.5), d/dt may be regarded 

| as the partial derivative with x; held fixed. 


Next, consider the tensor g= 


Suis ax, aN (9.7) 


By using this definition with Eqs. (9.2), (9.4), and the definition (3.8) of Gy 
_ we obtain the relations 


h 


te GY tron. tr (BO GY) = trie! g"*), (9.8) 
Se Be (Om = ODay (0.9) 
Sie aa, EG) = (ge g**),, (9.40) 

Ox, OX; Ou;(t) Ox; 1 Ox; Ou; (t) (G*),, 
OX, aX) aX, OX OX, ox, (9.11) 


as [in O;1 F Ui (t) 6 jl si Uj; i(t ) O:%] (City Pere 
In the last equation, the notation du,/dx;=wu, ; has been used. 


By making use of Eqs. (9.8) to (9.11), the constitutive equation (8.8) can be 
written in the form 


6; ; (¢) = [6;, 6;, + %:,4(4) Oj. +414 On (og +hgt+hg*)..+ 
= 2 ae ge"g+ APgeg?+ greg) + 


2 
Sede (Ge Suc G ie LG) ce Auettle Gy) Gace 
M,N=0 


a7 


+f [Kol t)geg+K,(t) (geg?+g%eg) + 


2 
+K, (2) geg? +g%eg) + D Kun(e) trleg’) 9) dr. 
The time-dependence of e and e”) is understood. According to (9.8), the scalar 
coefficients in (9.12) may be regarded as functions of trg® (N=1, 2, 3) rather 


than of tr GY (N=1, 2, 3). 
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The integration with respect to time in (9.12) is integration with Xp held J 
fixed. However, with neglect of terms quadratic in the displacements, 1t may) 
be regarded as an integration holding *; fixed. ; 

By using the Cayley-Hamilton theorem, g*? may be expressed as a linear! 
combination of g2,g, and JI, with coefficients which are polynomials in the: 
invariants tr gY (V=1, 2, 3). Therefore we may write 


Kg thet hea=hilt+hgt+hg’, (9.13) } 


where fy, f;, and f, are functions of tr g¥ (V=1, 2, 3). 

Denoting the integrand in Eq. (9.12) by k,; and its matrix by k=|k,,|, we » 
see that k is a symmetric matrix polynomial in the symmetric matrices g and e¢, , 
linear in e. The scalar coefficients are, of course, not necessarily polynomials. . 
According to RIvLIN [5], k may therefore be expressed in a form similar to (8.5). . 
In that expression we replace K by k, G by g and E by e to get the desired | 
result. Similarly, the summand in (9.12) can be expressed in a form analogous | 
to Eq. (8.7), by replacing G by g and E by e in the latter expression. These : 
results, taken with Eq. (9.13), imply that Eq. (9.12) can be reduced to the 
following form: 


05; (t) = [8x 0;2 + Ui,4 4) Oj. + 4; 1 Orn) FoI t+hg +h glk + 
+2, as? bin Oj. + a! (Oin 81 + 91 Bin) + 


| 


I 


+ al (Bin Sjn Snr + 9j1 Sin Sua) + 
2 

see ally (9™):;(9™)ai ef} (t) + (9.14) . 

M,N=0 | 


+ J |Po(e) 8:0 8js + Aa (0) (Bin 851 + 82828) + 
+ ky (t) (zx Simtel e 651 Satay 


By 


ee 2X Pun () 9") :(9™ uy €,1(¢ — t) dt. 


The scalar coefficients are functions of tr g’ (N=1, 2, 3). 
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A Generalised Rayleigh Quotient Iteration 
for Lambda-Matrices 


P, LANCASTER 


Communicated by A. OSTROWSKI 


1. Introduction 


The purpose of this paper is to present a new method for calculating the 
latent roots and vectors of a matrix * whose elements are polynomials in a scalar, J. 
We refer to such a matrix as a A-matrix, and consider the typical form: 


D,(A) = Ay A+ Aya t+.» +A, A+ 4), (1) 


where the A, (y=1, 2, ...,/) are matrices of the same order with real or complex 


elements. The latent roots, 4;, and the right and left latent vectors q;, 7; are 


defined so that ; 
elined so a D, (A,) q,= Oo and D, (A,) r= O, (2) 


where D; denotes the transpose of D). . 

We define a regulary A-matrix as a square A-matrix, of order 1 say, for which 
Ay is non-singular. It is easily seen that such a A-matrix has /m latent roots 
if they are counted according to their multiplicities. 

We next define a simple j-matrix as a regular A-matrix such that for any 
latent root A; whose multiplicity is «, the constant matrix D,(A;) has degeneracy « 
(or rank m —a), and also «Xm for all 7. 

This definition implies that there are « linearly independent vectors q; associ- 
ated with A;, and similarly for the r;. It can also be shown that it implies, and 


is implied by, the condition that A; has only linear elementary divisors. The 


former property has more significance in this application. 

It is well known that the latent roots and vectors of D,(A) can be found by 
considering those of an /m X/n matrix whose elements are linear in A. See for 
example FRAZER, DUNCAN & COLLAR [1], and Section 2 below. For such matrices 
well established methods of solution are available. In particular, OSTROWSKI 
has given detailed proofs for the convergence of iterative methods based on 
use of the Rayleigh quotient in a series of papers listed under reference [2]. These 
will be referred to in the following as OsTROWSKI [2,], ..., [2g]. It is our aim 
to generalise the Rayleigh quotient and two of OsTROwskI’s iterative processes 


* My use of the terms latent root and latent vector may not be very orthodox. 
It is not consistent with the usual phraseology in which we define the latent roots 
of a constant matrix A as the zeros of |4—AI|. In this work we speak only of the 
latent roots of matrices dependent on a scalar parameter. Thus we may refer to 
the latent voots of A—AI, while the latent roots of A will have no meaning. 
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in such a way that they can be applied to the 7 xm matrix D, (A) rather than 


a corresponding Jz XJn matrix, and thus obtain a possible reduction in the ' 


computational labour involved. 


Our derivation of the generalised Rayleigh quotient will involve a transforma- 
tion to the equivalent linear form used by the author in [3], and the proof of © 


convergence for simple matrices depends on the spectral form for [D,(A)}* 
obtained there. 

The generalisation of the classical Rayleigh quotient and its properties are 
obtained in Sections 2, 3 and 4. In Sections 5 and 7 we define and discuss the 
iterative process based on this generalisation. In Sections 8 and 9 we establish 
some local convergence properties for the iteration, and in Section 11 we give 
two numerical examples. The closely related process known as “‘broken iteration” 
is discussed in Section 10. 


2. The Rayleigh quotient for matrices linear in A 
When /=1 we have the simple A-matrix 


DAA oe 3) 


and there are two sets of m linearly independent vectors q; and r; obtained 
from (2). In the classical problem of small vibrations of a conservative mechanical 
system about a position of equilibrium A, and A, are real symmetric matrices, 


so that we may assume q;=1r;. This case was considered by CRANDALL [4] and © 


TEMPLE [5], and they both employ the Rayleigh quotient in the form (in our 
notation): 


as q A494 
R@)=—- £34. (4) 
The important properties of this quotient are that when considered as a function 
of the components of g, R has extreme values when q is any latent vector of 
D,(A), say q;, and the corresponding extreme value of R is the latent root A,;. 
(E.g. see GouLp [6].) 

OsTROWSKI obtains the form (4) in his paper [2,], but in paper [2,] he makes 
an important extension to non-symmetric matrices; although this is in the case 
A,=I (the unit matrix) only. In this case the sets of vectors q; and 7; are 
distinct and the first generalisation of Rayleigh’s quotient takes the form 


na 
BA liiagersonant (5) 


OstRowsKI shows that R(q, 1) has a stationary value at q=q;, T=1; 4% 
1=1,2,...,%; and that R(q,, 7, =A,;. 
The obvious generalisation to matrices of the form (3) is 
r A 
R(q,r) =— ae Pe ; (6) 
In the next section we will consider this quotient for a linear J-matrix 
equivalent to D,(A), and this will suggest a form of Rayleigh’s quotient defined 


in terms of D,(A) itself. The properties of R as stated for (5) carry over to our | 


more general form (proved in Sections 3 and 4); and in particular, these properties 
are established for (6) as a special case. 
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However, we note here that the value of R given by (6) when evaluated at 
q=4q; and r=r, follows immediately from the well-known biorthogonality 
property of the latent vectors. When the latent vectors are suitably normalised 


this takes the form: t 1 
r; Ay q; = 9;;; r;4,q;=— 6;;4;.- (7) 


When />1 no such simple orthogonality property exists, simply because 
there are /m latent vectors q; or r; in an n-dimensional vector space (v. Section 3 
of [3]). Therefore any method for linear matrices employing this property 
cannot be immediately generalised to problems in which /> 1. 


3. The generalised Rayleigh quotient 
For any latent vector # of either of equations (2) we denote by a) the product 
A°x, 4 being the corresponding latent root. Then it can easily be verified that 
equation (2) is equivalent to the following /m x/mn partitioned matrix equation 
whose elements are linear functions of A: 


0.0 =e ones rag) Ue ee Gust Se NG 
ACen Fork Oe Oreo. ay pam bE 
Ay A, Ay : teal aa eter tee O 
cae ee yess be allpess 
ilo 0 A, ole ari Sea 
tO. A, Ay... : Soe reins pee BEA eh 1A ENO 
BA ight). auho 1,4 |) uortteno Mae OGuewd 0 O txts 
mapeestand 
q 
gq’ 
Slee: =0 
q® 
q 


Denote the two /m x/m matrices appearing in this equation by -¥ and @ respec- 
tively. It is clear that since Ay is non-singular, so is .7; and that if all the 
| matrices A,, A,,..., A; are symmetric, then so are YW and @. Denote the /m x1 
vector generated by 4 and q by q*, and the corresponding vector generated 
by A and r by r*. 
Rayleigh’s quotient in the form (6) now suggests that we investigate 


ae rv’ € qt 
PAO SA) ee river gar (9) 
For the numerator of (9) we have 
Hag 
(i ACs bit i A ak ap 
q 
=r'{— (t—1) A A,— (L—2) 074A +> — PAL + Ah q (10) 


=r {1-5 Dy(d) + Dil} a. 
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a 


And for the denominator of (9) we have 


rt! Sgt =r Ag+ (i — 4) 2A, ++ 4+2AA;2+A;,334@ 


° (14) 

=n {7 Dj(0)} 4 | 
Combining (9), (10) and (11) we obtain the following form for the Rayleigh 
quotient: Rigen r’ D,(A) q ud 


alia 
| DA] 9 
and we consider this expression for all q, r, and 4, whether latent vectors and 


roots or not. (We assume always that 2” oD, (2)| q=0 for any set of latent 


t 


values. } The form (12) obviously has one of the usual properties of Rayleigh’s | 


quotient. That is, for latent vectors q;, 7; associated with the latent root Ais 


R(Qi. 15.44) = A;- (13) 


4. The stationary property 
We now prove that the function R(q,r, A) defined by (12) is such that R 
has a stationary value at q;,17;, A;, for i=1, 2, ...,1n. We may note in passing 
that the definition of R and the following proof do not depend on the polynomial 


nature of D, (A). Thus, the same results will apply to any suitably defined matrix . 


function of the scalar, A — although they will only be of value if latent roots 
and vectors exist. However, we will proceed with no change in notation. 

In order to establish the stationary property of R we make arbitrary small 
variations in q, r and A from a set of latent values, and show that the resulting 
change in R is zero to the first order. Thus, we write q=q;+6q, r=7,;+ 07, 
A=1,+64, so that 

R(qi.1;,4;) + OR=A;+ 6A (ri +67’) Di(Ai +64) (qi+oq) ; (14) 


, , d 
(ri+6r)| 5, Dil], (ata) 


For the numerator of the quotient on the right we have 


(rit 04") Dy(A,+ 82) (q:+ 64) ~ (v5+ dr") {D, (2) +44] 4 D, (A), | (q.+ 64), 


Ai 


~ bar ar), a 4 
to first order in the variations, using equations (2). For convenience we will 
in future denote Fez (4)|, by Df! (4,). Having already assumed that the 
right-hand side of (15) does not vanish, we may combine this with (44) and (13); 


when it is found that 6R=0 to first order in the variations; this being true 


for an arbitrary variation of q, 7” and 4 about any one of the Jm sets of latent 
values. Hence the following statement: 


The generalisation of Rayleigh’s quotient, R(q, r, A), given by (12) ts such that 


R(q;,1;, 4;) =A, for t=1, 2,...,ln, and R has a stationary value with respect 


to A. and the components of q and ¥ at each of these points, provided that r,D! (A,) q;=K0. 
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5. An iterative process 


We suppose that we are given an a ETE A, to a latent root, A, of D,. 
| We construct a sequence A,; s=0, 1, 2, ..., and two sequences of vectors, €,and y, 
by the following rules: 


Using arbitrary vectors w and 2, we form 


Bs = [D,(4,)]*w, = [D1 (A,)]> &: (16) 
i jand then 
As41=R(E5, 05, 5) 
i (17) 
Jog| ns Di (As) &s x 
. a DP a) Be 
' Iterative processes of this type are discussed by OstrowskI [2,] and by 


| CRANDALL [4] for some special linear A-matrices, and they both demonstrate 
‘that, provided |A)—A| is sufficiently small, the sequence A, tends towards the 
latent root ie i convergence being quadratic. That is, if |A,—A| =A, then 
i [Asia —A| = . We will establish convergence of this order for the iterative 
| process sega a (16) and (17) when applied to simple A-matrices of any degree. 
| Although the vectors w and 2 appearing in (16) are described as arbitrary, 
‘there is one condition to be satisfied by each of them, the reason for which will 
/become clear in Section 7. If the sequence A, converges to the latent root 4, 
‘then there must exist at least one pair of latent vectors q and r associated with 
‘A such that 


Y w==0- and q’2=0. (18) 

| 

In practice there will be zero probability of either of these conditions being 

violated, so the use of the word arbitrary may be excused. 

; An obvious difficulty arises if the matrices A, of (1) are all real and we seek 
a latent root which is complex. If w, z, and A, are all chosen to be real then the 
process breaks down, because the operations involved in (16) and (17) will all 

be real, so that the iterates can never converge to a complex set of latent values. 

In cases of doubt a complex /, should therefore be chosen. 

This iterative process may well be suitable for computation on a high-speed 
digital computer. The main computational difficulty may lie in the inversion 

of the ill-conditioned matrices, D,(A,). 


6. Spectral form for the inverse of a simple A-matrix 


For future reference we will now state a theorem proved by the author in 
[3] which gives an expression for [D,(A)]7* in terms of all the latent roots and 
vectors of equations (2), provided D, is simple, and provided A itself is not a 
latent root. In the following, Q and R denote the ” x/nm matrices whose columns 
are the two sets of J” latent vectors of equations (2) written in some fixed order. 
The corresponding /m xin diagonal matrix of the latent roots we call A. 

If q;,7;, 4; denote a set of latent values, we can rewrite a part of Theorem 2 
of [3] in the following way, better suited to the present work: 


* When n=1 this is simply NEwron’s method for finding the zeros of a poly- 
nomial of degree /. 
22* 
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Theorem 1. For a simple j-matrix D,(A) we have 
[Di A)]*#=O(A—A)?R, (19 
provided A--/;, and provided that the latent vectors are normalised so that 
7, DP (A) a=. (20) 
From our definition we can write Q as the partitioned matrix: 


(Giese Diels 


and similarly for R; while (I A — A) thas diagonal elements (A — 4,)4,..., (A— Ay») +. 
Using these forms on the right of (19) and finding the product of the partitione 
matrices, we see that (19) can also be written as: 


DM = ae (24) 


where the matrices F; are the outer products q;r;. Notice that (19) and (21), 
are in fact the same equation written in two different ways, and either of these 
forms may be used under the conditions of theorem 1. In particular, multiple: 
roots are permitted in the partial fraction expansion (21), provided the degeneracy, 
condition in our definition of a simple A-matrix is satisfied (cf. FRAZER, DUNCAN 
& CoLrar [7)). 

7. The approximate latent vectors 


If the sequence 4, tends towards a latent root, 2, which is a simple zero of! 
|D,(A)|, then the sequences of vectors &, and y, defined by (16) tend towards: 
the (unnormalised) latent vectors q and r. This is apparent if we write [D,(A,)]7' 
in the form (21). We have 


In 
te Fw 
= [D(A tw= > si. (22) § 
Jl : 
Now F,w=(q,r;) w=q,(r;w) =q; 0; say, where 
w;=7; Ww. (23) | 


If we denote the latent root to which the 4, converge by A, the corresponding ; 
vectors by q and r, and put r’w=a, then we may combine (22) and (23) to give: 


QO” 7 | oe oy ‘ 
Soyer reer Ling, ig te (24) 


where >)’ implies a summation in which the approximated latent root is omitted. 
Thus provided A, 1s sufficiently close to 4 and the first of the inequalities (18) 
is satisfied, the dominant part of &, is proportional to q. 


Similarly, we obtain (for a simple zero, A) 


where | 
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If the latent root has multiplicity «, and 1<a<n, then the « latent vectors, 
q, Say, corresponding to A are linearly independent (for a simple jA-matrix). 
In this case the vector mq appearing in (24) must be replaced by >) w,q,, and &, 
will tend towards a linear combination: of the « vectors q,. By taking « inde- 
pendent vectors w, the various limits correspond to « linearly independent 
‘combinations of the q,, and the sub-space generated by the latent vectors of 
A is then completely determined. The vectors themselves are not determined 
jto within an arbitrary orthogonal transformation, after being normalised accord- 
‘ing to (20). 

In order to verify this last statement, we note first that any linear combi- 
nation of the q,’s (or of the r,’s) is a latent vector with latent root 2. Next we 
note that the sets of latent vectors q, and r, (associated with A), after being 
mormalised according to (20), satisfy the following condition: 


1, DY” (A) Wy = Ouys 


‘where we have 1Su, ya. (This is easily deduced from equation (20) of [3].) 
If we now denote by Q, and R, the m xa matrices whose columns are the vectors 
q, and Yr, respectively, then the above condition can be written in the form 


Dy (A) Qa = Tu: 


! Let G be an arbitrary « x« orthogonal matrix, and replace Q, and R,, in the 
‘previous equation by Q,G and R,G; then 


(R,, G)' DP (A) (Q,, G) = G' Ry DP (A) Q,.G 
AG ING 
Sit: 


_ Thus the component vectors of Q,G and R,G satisfy the same conditions 
‘as those of Q, and R,. That is, these vectors are not defined within an arbitrary 
orthogonal transformation. 


Another argument justifying the use of (16) has been given by OsTRowskI [8]. 


| 8. Convergence 
In this section we establish a “‘local’’ result concerning the convergence of 
‘the iterative scheme described in Section 5 — when applied to simple A-matrices. 


First we prove the following lemmas. 


Lemma 1. Under the conditions of theorem 1 we have 
£D,(A) = D,(A) Q(L4 — A)? RD, (A). (27) 


For any square matrix U of order whose elements are differentiable functions 
of a scalar, A, and for which U™ exists, we have 


OAR) = 1, 
so that 
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and d Ba! day 
Seti U(A) |, U>(A)| UIA). 
Putting U=D, and making use of (19) gives the result at once. 


Lemma 2. Under the conditions of theorem 1 the sequence A,, S=O, 1, 2,..., 
defined by (16) and (17) 1s given by | 


_ & OAL A,—A)-2 Rw : 
Assa z’O([A,—A)2R’ w * (28), 


Using (16) we obtain for the numerator of the quotient in (17): 


ns Di(A.) = 2’ [Di(4,)} 7, 


=2' QO(1A,—A)*R’w, (29) 


using (19). For the denominator of the quotient in (17) we obtain from (16) 
and (27): | 
n, Df? (A,) &, =2' Q(L A, — A)? Rw. 


This can be combined with (29) to give 
Ans Di» (A,) e ei Ns D, (A,) a =e QAI A, 5 PANS ak 
and substitution in (17) now gives the desired result. 


Theorem 2. I the conditions of theorem 1 apply and the sequence i, converges * 
to a latent root, then the convergence 1s at least quadratic. 


If the sequence A, converges to the latent root 4, then using lemma 2 we: 
may write 


_ 2 Q(A—AL) (LA,—A)?2 RR’ w | 
Us ge 2’ Q (I 4;— A)? R’ w : (30) | 


From (23) and (26) we see that the / x1 vectors R’w and Q’z are conveniently 
denoted by w and ©, these vectors having elements ;, ¢; respectively, for 
GND, onan be Wiss 


ln 
A;—4 
ie Ope ee 
df (A,— Ai)? 
3 Cj 0; 
(AA)? 


since the square matrices in the first expression are all diagonal matrices. In 
the final expression on the right of (31) amy A; for which all the corresponding ; 
products ¢;,; vanish does not appear in either summation. Denote the distinct! 


A, for which at least one product ¢, w; is non-zero by 01,09, ...,0,,-. Then (34) ' 
can be written: 


4 8 (AAD) (14d) 
CTA Arte 


Asay 


(31) | 


m es | 
2? Tai? 


= 
<i Pi 
De (A; —0;)? 


, 
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where the #; are non-zero constants. If we denote by # the value of p; corre- 
sponding to the latent root A (p=£0 by virtue of (18)), we have 


at Ge 
woe fon Dir, — 
i ee oat 
p+ ae 


' where >)’ now implies a summation from 1 to m in which the approximated 
» latent root is omitted. Thus, if 2,—4A, we find that 


| 
| As ipa 
aa ee (33) 


_ and hence the convergence will in general be quadratic. 


| 
| Asay A= 


= (A, — 4)? 


9. The convergence neighbourhood 


We now generalise the treatment of OSTROWSKI [2,] to establish a convergence 
neighbourhood for the above process. That is, to define limits within which an 
approximation A, (to 4) must lie in order that |2,.,— , and the 
| successive approximations to 4 are such that the sequence |A,—A| is strictly 
, decreasing to zero as s increases. 


Let d=Min|o;—A 


and suppose that N (= 2) is a constant such that 


1 
|A, —A| a5, and hence |A,—4,| 2 a(t ~>}- 


Then 
o;—A 1 Ne (As— A)? 2 1 
| (2,4) | =a (woe om | (2,—9;)2 | = (N=1)8 * Oe 
Using the inequality |a+ | = ||a|—|0|| for any complex a and 8, we obtain 
from (32); 


Pe ale aan Gan 
A,;— i)? 
and we assume that the denominator does not vanish. 


Pat P= >;|p;| anda= 
= 


, so that >)’ |;| =P —a. Then using the inequali- 


1 _N2(P—a)/(N—-1)? 
(P—myj(N—1) G5) 


ties (34) we get phe 
(As— A)? 


a sufficient condition for the manipulation of the denominator being that 
a >(P—2x)/(N —1)?, 1.2. 
1 
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Under this condition (35) can be written 


Fav eee) N(P—a)_ am 
ea = 7 (N=2N12)a—P J4s— A], (37) 
and we deduce that: 
If for some s we have 


day: ( {N?=—2N +2) 4 P 
|, a| <4, Min{ NP x) tp, (38) 


and (36) is satisfied, then from this index onwards the distances |A, —A| are strictly 
diminishing and converge to zero. 

The strength of this result will depend on 7. For /=1 and all the latent roots 
real and positive OstrowskI [2,] uses N=2 to obtain a comparatively large 
convergence neighbourhood. If we allow complex roots then (38) gives a con- 
vergence neighbourhood of radius d/2 provided 2 >3P/4. In the case examined 
by Ostrowski the same convergence neighbourhood is obtained if a>2P/3; 
a slightly stronger result. 


Notice that in the case /=1 it is always possible that =P. That is, that 


p;=0 for ~;+p. This is achieved if, for each latent root A; A, either o;=0 
or €;=0 (ve: equations (23) and (26)). 
However, if />1 there is an important class of matrices for which we have 


the strict inequality 7< P; namely those for which all sets of right latent vectors | 
taken at a time are linearly independent and similarly for the left latent — 
vectors. The proof of this is easy, but we will not give it here. From the assump- — 


tion (18) we observe that #=-0 so that for this class of A-matrices there must 
be at least one other coefficient, £;, which is non-zero, and for large / there may 
be a lot more than one. Thus for />1 the condition 7 >3P/4 obtained from 
(38) by taking N=2 may be quite a stringent one and, in theory at least, a 
smaller convergence neighbourhood may be more reasonable. For instance, by 
taking N=4 we obtain a convergence neighbourhood of radius d/4 provided 
> 5 P/14= (0-36) P. 

Hence it appears that, for given starting vectors w and 2, the accuracy of 
the initial guess A) will have to increase with J, the degree of the A-matrix; while 
w and 2 should be chosen to make a (=|/| =|q’2-1r’2w| for a simple root) 
approach P (=) |#,|) as nearly as possible. 


10. Broken iteration 


A second iterative process known as broken iteration is considered by 


OsTROWSKI [2,] and by CranpaLt [4], and suggests the following modified 
process: 


For any i, define 
E = [D, (Aj ee, Ub = [D; CD ees ee (39) 


for S=0,1,2,..., where &, and n_, are arbitrary starting vectors w and z satisfy- 
ing (18); and put 


As41 =RE,,. 0, hae (40) 


—— — 
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This scheme should be compared with that of Section 5. The arbitrary 


vectors w and @ used at each step of the former scheme are now replaced by the 


successive approximations §,_,, y,_, to the latent vectors q and r. When /=1 
it can be shown that if the above scheme is slightly modified to take advantage 


of the orthogonality properties of the latent vectors (cf. CRANDALL [4]), then 
_ cubic convergence may be achieved. However, it appears that if />1 this rate 


of convergence cannot be achieved, at least for the class of matrices defined 
at the end of the previous section (7.e. those for which 7< P). Using broken 
iteration may increase the rate of convergence, but probably not by another 
order of magnitude. 


11. Numerical examples 


a) The oscillations of a wing in an airstream 


This example is taken from the work of FRAZER, DuNcAN & COLLAR [9]. 
With one or two obvious modifications to the numerical values the problem 
reduces to finding the latent roots and vectors of D, (A), where 


17-6° 1:28)" 2:89 7-66 2:45 2:40 
A, =| 1-28 0-824 0-413], A,=]0-230 1-04 0-223], 
2:89 0-413 0-725 0-600 0:756 0-658 
121. 18-9 15-9 
A,=| 0 2-70 0-145]. 
11:9 3:64 15-5 


From experience of similar problems a person experienced in aeroelastic work 
may take as first approximations to the largest latent root and associated right 
latent vector: 

A, = 72 10:0) eeand tb — 0 O40: 


For convenience we take 2=w in equations (16). After inverting the matrix 
D,(Ao) we find from (16) that 


— 0:-1481 + 7(0-00289) — 0-1476 — 1(0-00528) 
E) =| — 0:2764 +2(0-01700)|, =| — 0.2716 + 2(0-02126) |, 
1-0 1-0 


where, to facilitate comparisons, we normalise the vectors so that the largest 
element is one. Substituting 2), &) and yp» in (17) (and using the fact that 
n,D,(A,) §,=w’&,) it is found that 


Vi 0756257 18-535). 


This completes one cycle of the iteration. In this case the choice of w is 
not decisive in determining the question of convergence. It is found that with 
the above choice of A, and w’=[1-0 0 0] or w’=[0 1-0 0] the process would 
still converge, and hence would do so for an arbitrary choice of w. 
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In the next cycle of operations it is found that 


— 0:14938 + 1 (0-004 416) — 0-146 63 — (0-005 382) 
€ =| —0-27756 —1(0-020362)|, m=} — 0:26753 — (0-023 272) 
1-0 1-0 


and then 
Ay = — 0-8834 + 1(8-4411). 


To this accuracy the latent root is in fact 
— 0-8848 + 1(8-4415). 


The value given in [9] is —0-885 +7(8-443), so the fourth significant figure of 
the calculations quoted there is suspect. Bearing this in mind, the corresponding 
right latent vector as given in [9] is (after renormalisation) 


— 01494 +7 (0-00466) 
— 02778 — i (0-02066) | . 
1-0 


We observe that in this case results which are sufficiently accurate for most 
engineering purposes are obtained after only two steps of the iteration. A further 
important feature of the method is the possibility of finding latent roots in any 
order. This is in contrast to the iterative (power) method used in [9] which 
first gives the dominant root, and the accuracy with which sub-dominant roots 
are calculated depends on the accuracy of those already found. 

However, the example chosen above may not be typical of problems met in 
practice, as the distribution of the latent roots is particularly favourable. The 
complete set of latent roots is: 


— 0-885 +4(8-442), 0-097-+7(2-522), —0-920-+7(1-760). 


b) Perturbation problems 


We consider here a rather special perturbation problem. A mechanical system 
of 2 degrees of freedom oscillating about a position of equilibrium in the presence 
of small viscous damping forces gives rise to the following equations: 


Ap+eBp+Cp=o0, 


where A, B and C are n Xn real symmetric matrices of constants, ¢ is a scalar 
which is small compared to the elements of A, B and C, and p is the column 
vector of generalised coordinates. Thus the term ¢ By represents a perturbation 
from the conservative system obtained when ¢=0. On looking for a solution 
in the form p=qe?', we obtain 


(422+ eBA+C)q=0. (41) 


We take as our first approximation to a solution of (41) a solution obtained 
when ¢=0, say dy, Qo. It is well known that @) may be normalised so that 


QMAGQ=1, and qCq=—2 (42) 


A generalised Rayleigh quotient iteration for lambda-matrices a2) 


(cf. equations (7)). In order to improve on the approximation 4) we use (17) 
with w=z2=q, and find 
=f, as Go (A A+eBAyt+C) q 
G(2A A +eB) gq © 


Using (42) this We a to 


= eB ry 
Ay Ao Age’ 


where B=qoBqy. If we assume that the exact latent vector q differs from q 
by terms of order ¢, then we have 


A=, — teh +0(e2 


or, to the order of e¢ indicated, 


Ay =hy— ZEB. (43) 


This first correction was obtained by Lord Rayleicu [10], and has been 
discussed further by the author [//]. As a numerical example we consider that 
of Section 5 of [11]. The matrices appearing in (41) (after transformation to 
suitable principal coordinates for the undamped system) are: 


0-602856 0 — 0-255274 1 @. © 
Ave.) © aes 0-966840 — 0-216403], C=] 0 1-0 0] ; 
5-761640 0 oO 2:0 


and we take e«=0.-01. 
In (16) we have w=z%=@p, so that the sequences &, and yn, coincide. For 
Ay we take the isolated root of the undamped system, 7.e. Ay =7|/2. Clearly then, 
Qo is the column vector with elements 0, 0,1. We alter the order of operations 
n (16) and (17) and write 
&1 = [De (Ao) ] * Go = [4 Ao + € Bay + C)*40- 


Now from (23) we have in this case w;=@i@o, and since q, differs from q by 
terms of order ¢, the orthogonality property shows that w;=O(e). From (24) 
we now see that after being normalised, §£;=q-+0O(e?). Calculation gives 


0-000031 — 7 (0-003 610) 
€, =| 0-000042 — 7 (0-003 060) | , 
1-0 
and this vector has been normalised to make its largest element one. (This 


manipulation will obviously not affect (17).) To the same order of accuracy 
we get immediately from (43): 


A, = 1/2 — (0-005) (5-761 640) 
= — 0-0288082 + 7 (1-4142136). 
Then the second application of (17) gives 


ae R(&., Ay) 
= — 0-0288 086 + 7(1-4139122), 
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and the error should be O(e4). In fact the exact root and vector are: 
— 0-000189 — 7 (0-003 601) 


4 = — 0-0288084 + 7(1-4139127), q@=|—0-000145 — 7(0-003 054) 
1-0 


The method used in this example may obviously be used to investigate the 
effect of a small perturbation on any known latent root and vector of a lambda 
matrix — the first order perturbation of the latent root being particularly easy 


to obtain. 


The author is grateful to the Society for Industrial and Applied Mathematics 
for the opportunity of presenting this work at the Symposium on Matrix Computations 
held at Gatlinburg, Tennessee in April, 1961. 
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Elementary Function Representations 


of Schlimilch Series 


VICTOR TWERSKY 


Communicated by A. ERDELYI 


1. Introduction 


Alternative representations, as series of more elementary functions, are 
obtained for the Schlémilch series! 


(1) Z,,,(s D) cos(sD sin >), 
1 


(2) > Zen41($D) sin(sDsin 9), 
Si 


where Z,, is any solution of Bessel’s equation for integral m, and where D>0, 
0<sin J)<1. The series converge provided D (1 +sin %)/2% and D(1—sin®%)/22 
do not equal integers. 

These series are of interest in a variety of physical problems. In particular 
the special case of (1) with %)=0, which was considered by IcNatowsky [2], 
arises in the analysis of the scattering of a plane wave normally incident on a 
diffraction grating [3] (¢.e., on a planar periodic array of parallel cylinders). 
For such problems, we have D=kd=2zad/A, where d is the spacing between 
cylinders and 4 is the wavelength. The more general series (1) and (2) arise for 
arbitrary angle of incidence on the grating [4], for related waveguide problems, 
etc. Various special cases of (1) and (2) arise in other scattering problems, in 
characteristic value problems in the theory of vibrations, and in communications 
engineering. 

The series we are concerned with may be collected in the form 


(3) A ee a Ac eh, 
Sa 


where H,, =H‘ = J, +7N, represents the usual Hankel function of the first kind, 
and where J, and N, are the Bessel and Neumann functions. In general?, the 
representation (3) is too slowly convergent either for numerical computations 
or for the determination of the explicit dependence of 4%, on the parameters 
n, D, and %,. The purpose of this paper is to derive an alternative representation 
in terms of elementary functions. The case m=0 is given by MAGNus & OBER- 
HETTINGER [6] (pp. 59, 60) and will not be evaluated. 


1 See Watson [1], Ch. XIX, for a general discussion of such series. 
2 For some purposes adequate results may be obtained from a representation 
of (3) based on the asymptotic form of H, for D>n, D>1. See [5]. 
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In order to obtain a more useful form, we consider initially 


jig (R) = ¥. ef9P8inO H, (R,) i" ei" % — H, (Ry) ei", 


we) 


4) R= X*4(Y—sD), Ry=R=(X?4+ Y%, p, = tan ( $ 


, and y,> +42 for s $0; consequently we may 


As R->0, we see that R,>|sD 
express (3) as 
(5 H, = Vim h, (R). 


In the above, we have essentially added and subtracted the term containing 

H,,(R), there being no singular terms in the limit. This step was introduced by 

Icnatowsky [2] for 9,=0; aside from this, our procedure is quite different 

from IcNaTowsky’s and obviates his detailed algebraic manipulations: Instead 

of using series expansions of H,,(R) for R-+0, we recast h, as the difference of 

a sum minus the analogous integral, 7.e., (>; —Jfdu) f(u). The limit of this 
7) 


result is then split into two parts, one which is rapidly convergent, and one 
which by the use of Euler’s summation formula can be written as a finite sum. 


It will be seen that the parameter Y is unessential in the above and could 
be set equal to zero from the start; however, its inclusion provides a slight 
generalization of interest to related physical problems. For such problems we 
have X=kx and Y=ky, where x and y are the usual Cartesian coordinates. 


2. Derivation of an Operational Form 


Using the Sommerfeld integral representation of the Hankel function [6, p. 41], 
and exploiting the full range available for the limits of the path in the complex 
domain, we write 


4%—100 


(6) He (R,) 4” et Ps as he ‘3 Eh RESET 
4 
—t2+100 


which holds for all |p,|<432, i.e. X>0. (Note that the limits are the same 
for all s in spite of the fact that the , differ.) Equivalently, in terms of f=sinr, 


we have 
lee) 


(7) H,,(R,) i% ei" % = ae | ccdlieeBies Pa e 
4 yi-# 


tnsint 


where /1—2=1 /®—1 if |] >1. (The case n=0 is given in [6, p. 40].) 
We now substitute (7) into (4), and use Porsson’s summation formula in 


1 = iS a r is ~ H 
the form SDN | i) er dt Ny {(2u%-+ a) as in [6, p. 247] (or, equi- 


u=—0Co 


valently, identify the sum over s as the “periodic 6-function’’). Thus, in terms of 


sin}, = sind) + 2un/D, PSPs ee, 
Cy=1/D V1 —sin?d, if sin?O,<1, C,=1/iDsin?d,—1 if sin2d,>1, 


i (9) >, eiSDsindy 
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we obtain 
(R,) 4” ent nes a) s Gi etn ou+t fe coer Xe 0. 
s=—co L=—-CO 
Similarly for X <0, we replace g, by x — g, in (6), and obtain (9) with 0, replaced 
By 7 —?,,. 
Alternative derivations of (9) are given by REICHE [7] and Ienatowsky [2, 3]. 


_ (The present derivation is briefer, and provides results and notation we require 


subsequently.) The special case 7=0 is also a special case of an expression on 
p. 62 of MAGNus & OBERHETTINGER [6] (and the result for »++0 may also be 
obtained, after some manipulation, from their related expression). Related series 


are considered by ScumiptT et al. [8], who also give additional references. 


The series in (9) converge provided that 


D(1 +sin 0)/22 = m, + 0,1, 2,.... 


| Were m, or m_ an integer, then the corresponding value of C,, in the right-hand 


side of (9) would be infinite. Similarly, since H,,(kR,) ~i~” lone 0 leah 


z|s|D 
the left side of (9) would also diverge for integral m, or m_. (For non-integral 
values, convergence considerations are obviated by using R,(1+7e), e>+0.) 


Using the terminology of grating theory [4], we may distinguish two kinds 


of terms, or “‘modes’’, in the sum over w. (All the angles are measured from 
_ the grating’s normal, 7.e., 


Re#,|< $2.) The “propagating modes” correspond 
to | sin a ee cos o.= /1—sin? #,; for these, we have —u_SwSm, where uw, 
and w_ are the closest integers to m, and m_ for which | sin ial nce © On pay i 
[m,]=[D(1—sin 8,)/22], and wu =[m_]=[D(1+sin %)/22]. The remaining 


| values u>u,+1>m,, and uw<—u_—1<— m_are “evanescent modes” for which 


fein 3,,|>1, cos ,,—=7 |sin2 #,,—1; for these we may use cosh 7, =sinBy+ 2u 2/D, 


where 3,=32—1|n,| for w>u.+1, and 6,=—$%+7|n,| for w< —p_—1. 


(The excluded values sin®,—-+1, which correspond to integral values of 
m., =D (1 sin Jo)/2a, are known as the “grazing modes”’ or “Rayleigh values”’.) 

Returning to (4), we use the integral representation as in (7) for the isolated 
term H,,(R,) 1” e-'"™. For convenience, we write 


(10) jab (Ro) 4” ew Po =o, gain Outi Y sin du+1X cos ou du, eS 0, 
where (in order to obtain the same kernel as in (9)) we changed the variable 
of integration to w such that —¢=sin #+22u/D=sin 9,. 


Thus, for X>0, we obtain 


co foe) : 
(11) h, (R) =e 2 ( > ——_ F dy) oF etn Ourt eee i ae CUS Cis 
u=—CO —CO 
where yu is an integer for the sum operation and a continuous variable for the 
integral; similarly for X <0, we replace 3, by 7 —#,,. (Using grating terminology 
[4], we may say that (11) indicates a sum over the discrete plane wave modes 
of the grating minus the integral over the analogous range of continuous modes.) 
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Substituting (11) in (5), and letting Y=0, X=e—-+0, we write 


le,2) 


(12) #,=Lim2( > LS fig gee Cle bene 0 


e—>0 pw=—0CO —O0O 


Equivalently, if we first replace X by —X and #, by 7—#, in (41) and then 


use (5), we obtain 


(13) ONO laa 1 
The arithmetical mean of (12) and (13), 7.2., 

(14) 8, =SC,fenintu 4. ein m— Oud), 
satisfies 

(15) Hi, =(— 1)!" Hn, 


as obtained directly from (3) by using H,=(—1)"H_,. Because of (15), we 


consider explicitly only positive values of n. 


In performing the operation §/(u) we split it into parts which are explicitly : 


L 


convergent and parts which require the use of the exponential “‘convergence 


factor” e*°°s*, where cos #,,~7|u| 22/D as |u|—>oo. For the first, we evaluate — 
the resulting integrals and leave the sums unaltered. For the second, we arrange — 
matters so that the operation is carried out essentially on a polynomial, and 

then use EULER’S summation formula to obtain a finite sum. For convenience, ~ 


we apply EuLrEr’s formula in the form given by NORLUND [9]: 


lee) 


(16) Lim { f 2" 1 etide— Dealt hen od BA (xn, 


é—>0 s=0 


where B,,(x) is the Bernoulli polynomial. Specifically, we shall use the equi- | 


valent form 


S.(u + Asin 9)"~ ae Lita) Sie f du) ( (u + Asin 9,)"-2 eos ou 


(17) é>0 \y=0 —Asind, 
=— B, (Asin @,)/n, 
where w= — Asin 0) =—(D/2x)sin 9 corresponds to 9,=0. Since cos #,= 


[1— (w+ Asin )?/A?}! ~i(u+Asin %)/A, the convergence factors in (16) and 


(17) are equivalent for the purpose at hand. 


3. Elementary Function Representations 
We rewrite (13) as 


(1 8) KH, =) Sa Gs etn (7—9u) Be Ss. oe ewin (2—9y) 
where 
—1 —Asin®, 
Se f(u) —= Lim ( ey = if dy i (uw) ee 
(19) Boe we 


[o.8) 


S. f(y) =Lim(S — f du) (we, A = Djam, 


0 —Asind, 
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| Adding and subtracting e~'"* in the kernel of §, and introducing 


(20) jo =2 SS c fe—*# (7—8n) cake iad ea ; 
we write 
(21) Hao Se ee e~in(n—Ou) 1 9 S. Ge en in Ou +F,. 
In the range of $_ for which sin #,<—1 we have 0,,=—}$2+%|7,|; similarly 


| in the range of S, for which sin #,>1 we have },=32—i|n,|. Consequently 
| the first two terms on the right-hand side of (20) converge independently of the 
| implicit convergence factor. Thus letting e=0 in the first two terms, we evaluate 
the integrals, ¢.g., 


| —Asin dy 0 
| aie mtdu=" fe ei” di — ; 
tn 1 

—42+100 


» and leave the sums as they stand to obtain 


22). #, aetna la WG, en eu _ qe3|+ 2 ie eu + F,. 


tn 


To evaluate F,, we express the kernel of (20) as a polynomial in sin #,= 
| (u+Asin #)/M, and then apply EvuLEr’s summation formula as exhibited in 
)} (47). Thus for 2 even, we use 


(23) sin 2n 0 Eos (—1)"-1 22-1 (m 4m—1)! sing => [1, m] sin?" 9 


cos } ne (2m—1)!(n—my)! 


_ and the form sin 8, = (u+Asin 9>)/A to rewrite (20) as 


m=1 


| Se sinand DMS ap 
| (24) i= 5 Ss ae Nie s ee ie S. («+ Asin J)?" 
lu 


Applying (17), we obtain 


Tiere 2i wo [n,m] Bom 
ene ee, A2™-1 9m 
m=1 


(25) 


Similarly for n odd, we use 


eee =e ee eee sin?) = Y fn, m} sin?” 9 


cos & lire (2m)! (n—m)! — 
to write (20) as 
~ 2n+1) a 
(27) Fanw= x S. ae = Seu ee S. (u + Asin ®,)? 


Applying (17), we obtain 


2 . {n, m} Bom+1 
aA 25 At (2m+1) ° 


(28) Ii, 2 Naa 


Thus (22) together with (25) and (28) are the desired elementary function 
representations of (3) for n>0. (If m<0, we use (15) and the above.) More 
Arch. Rational Mech. Anal., Vol. 8 23 
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explicitly 
H,, = 2 >" Hy, (sD) cos (sD sin Jy) = 2 § C, cos 2nB, 
s=1 


0 =e 
(29) Bi sears tits 
0 
i ae (n+m—1)! By,(A sin 3) 
eo a a m)! (n—m)! A2™ : 


Koni = — 21 >’ Hoy+1(sD) sin(sD sin J) = — 21$ C, sin(2m + 1) #, 


s=1 


(30) Gy, cea ee -25'¢, gene) Pi te 
0 


(—1)" 2°" (n+m)! Bom+1(A sin 39) 
ee 
where H,, =H) is the Hankel function of the first kind as defined by Watson [/], 
B,, is the Bernoulli polynomial as defined by NORLUND [9] and ERDELYI et al. — 
[10], and C,,, B, (in terms of J=D/2z) are defined in (8) and discussed in sub- ~ 
sequent paragraphs. 
4. Special Cases and Applications 
Series for 85=0. If #)=0, then the “Hankel series’’ in (30) vanishes. Equi- 
valently, since for this case sin ®,,=y/d = —sin #_,, and since B,(0)=—4 and | 
B,,-1(0)=0 for n>0 [10, p. 38], we see that the final form in (30) reduces to 
2C)+2B,/x1A=2/D+4B,/D=0. Consequently, for this case we obtain 


Hy, = 2 >' H., (sD) => a om ei en Oy 
(31) —t 


tage! = 1)fegem | ee 
a (2m)! (m—m)! A2™ : 


where »>0. Writing 
(32) Mag 2 Jaa) +422 N,,(sD) = Poet tae 
s=1 Sa 


we separate (31) into the corresponding real “Bessel series’? and “Neumann 
Series! = 


62 bean tears cos 2 b, 
(33) P2n ane D a Di cos}, ” 
Woaans N, 4 ~ (—1)™ ae m (0) 
2n ~ 2n (SD) = BS a De (2m)! (n—my)! haem 
(34) ‘i i 


4 . sin 2n 0, 4(—1)" <4 e-20mm 

shee: cos &, D i sinh ,, ’ 

where sin?,=27 u/D = =p/A=coshn,, b.<A<p,+4. Since e~?*"H/sinh 7 == 
i lh 2 2n \ P 

[a Syn -1] (4) aie , and since the Bernoulli numbers B,,,(0) equal 


— (—1)”B, in Ienatowsxy’s [2] notation, the present series equal those he 
derived originally [2]. 
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More generally, we may write 


(35) He, = bert, 
| where 
| (36) tS: = »3 13 (sD) oe es 4)" 4+ me =$ CF ene ae eg eat Ne 
c= 


(37) i = > N, (sD) fesesiads (— AE gts P sind) we = Se Ale Cm ed Ee am ha (1 
1 é 


Here the subscripts and e designate that the operation is restricted to the 
propagating range (|sin #,|<1) and evanescent range respectively. Thus, for 
all , these ‘Bessel series” and ‘(Neumann series” are associated with the pro- 


-pagating and evanescent ranges respectively. 


| Bessel series. Elementary function representations of the Bessel series are 
obtained by inspection of (29) and (30) or by direct application of 


w-(B—feina(— FJ 


= : 2 <4 cos 2n%, 
i 4 Fan=? 2 Fan(s) cos (sD sin Jy) = pa ae One, 
3 s= _ 


See ee Oe or, 


Thus 


Oops Ora 0) if nm >O0, 


| co ee ee on 
ea 21 Jaa 6D) sins sind ey eT 
. fie 


a cos , 
(where we included the case =O for completeness). These finite-sum repre- 
_sentations for Y, (whose derivation does not involve the limiting process indicated 


in (5) and (21)) may of course also be obtained by various more or less elementary 
procedures. 


Neumann series. Perhaps the simplest representations for V,, are those left 
‘in the form 4%, — Y, with the #’s as in (29) and (30) and the Y’s as in (38) and 
(39). (In any event, it is somewhat simpler to obtain -¥,, by first deriving %, as 
we have done than to apply S, as in (37).) However, it is convenient for com- 
putations to have these components explicitly. Thus 


Noon = 2 »s N,,, ($s D) cos (sD sin J) 


Su 
n 
4 (Sayre SS ee 
(40) Bt: mie Gee) tea 

Tt at 

2 oe >) sin 2n By, EW bi een ! i a 

see oe as = : = 

D . cos #, D res sinh 77% a sinh 47 


where 
cosh yt = + sin#% + y/d = +sin% + 20u/D. 


Pag. 
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Similarly 
Nani ae 27 >), +1 (8D) sin (sD sin 9) 
s=1 
—2i S (—1)™ 2°" (n+m)! Bom(Asin do) 
(41) == ie (2m+1)!(n—m)! Amt 
ae 


ri + cos (2n-+1) Oy ab 1)” | Ss eo (2nt1) np +S e7 (20+) “| 
ee F(> ~ 2 cos 8, pete. ack sinh 74 coe sinh nz 
It is Se convenient to suppress the =0 term of (40) or (41) by replacing ; 
B,, by Byes ’ (Asin 3)"~7 in the sum over m. 


For retirees we include 
Ue. 
Hy=— 20g 22 + (D+ ae 


(42) oo 
1 1 si! 
mas St = ine 2 | | 
sinh 44 27 Dsinhyz 274 


as in [6, p. 60] or as obtained by the Aes vernal and EULER’s | 


(> — fay) + =logy, y=1.781.... (For this case we set e=O in (37), carry’ 
al 1 


out the operations for a finite upper limit M, and then let M—o.) 

Related series. The above results may be applied directly to obtain elementary 
function forms for related Schlomilch series. Thus using the recursive relations : 
for the cylindrical functions, we have 


(43) AQn+1) $) Hants(D) 695 5 D sin By) = Han + Hanen 


s=1 
Shes : ' d 
(44) 2¥ Hanya (sD) cos (sD sin 9) =H, — Hansa, Hya(x) = 4-H, (x), 
Seah 


etc. Additional results follow from the trigonometric identities and by differ- 
entiation with respect to sin J. 

Expansions for D(1-Esin 0)/2a7<1. If there is only one propagating mode, 
2.€., D(1+sin )/2a%<1 (which is of physical interest in the scattering of wave- 
lengths larger than the grating spacing, and in related waveguide problems), 
then the Bessel function series reduce to 

2cos 2” H 21 sin(2n+1) & 
(45) Pan= D cos 8 Sno Fanta = Doe. ifs . 
In this range, the leading Neumann function series up to terms of the order D2 are 


given by Di~ pe a 
Nip loge aoe 2 5 
0 pond pee sls +- sin 8) 1.202, 
My we 2B Pe _ Fae Pe 4.202, 
42 
4% : D2 
(40) ee bs a (L— 28in® Dy) + 24 1.202, 
ze iene ; yeas 
ce Gwe : a asin Do (1 rae sin? dq) , 
= 25 3 16 1 , ‘ 
Ny an re DF (= = sin 9p) + 5 (1 — 8sin? #, + 8sin* 9). 
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fo, 0} 
| Here we have used the fact that >) uw? 1.202. The next terms in the above 
1 


are of order D4. (Note that Y? and Y are proportional to D+ and consequently 
tend to infinity more rapidly than 4% and 1% as D0.) 


The leading terms of VW for large m are given by 


| es ty ae eae) 
/ p) Nan an(Dj2ne" —’ 
aD Dagy 


{ 
| (48) Non iS) (dls ne a Ao Bon (0) 
| 


IGNATOWSKyY [2] showed for the case #)=0 that the leading term of 3, 
for D0 could also be obtained from the original Neumann function series 
by heuristically using the leading term of N,,(¢) for 1 >>?, 7.e., by using N, (é) x 


= aa Similarly for arbitrary #), we find for D0, 

! mt 

Bas meet ec es atom one 

| Neo 2 Man(sD) cos (sD sin 9) ~ En 2 a0 (47), 

; 08 ie eee 

| Nays = — 24D) Nan a(S) sin (sD sin 8) ~ Be eet 


s=1 s=1 


ll 


where the final forms represent the right-hand sides of (47) and (48). 


i 


Near grazing mode. Another range of physical interest corresponds to a 
| “discrete mode near grazing”, 7.e., to #,,, (the “last” propagating mode) or #,,, +4 
| (the “‘first’’ evanescent mode) approaching 3a. If a propagating mode is near 
grazing, then (22) yields 
: 


o 


M+ 


(49) ee =n —6&, Oe 15 
where G,, is finite in the limit. On the other hand, if an evanescent mode is near 
_ grazing, then 
| ne 2i-" 
(50) or a ey a 


nie SE 
apd = we OLE. 


Similarly if 3, -— 3a, we obtain 2" instead of 7~”. If both #,, and #,_ are 
near grazing, then we replace 7~” in the above by 2cos(nz/2); thus for 7 even, 
the isolated terms involve 2(—1)” instead of 7~”, and for m odd, they vanish. 
The factor of 7 which distinguishes the second case from the first is significant 
in connection with the “Wood anomalies’, or ‘“‘grating resonances” [4]; it is 
related to maxima of the scattered field occurring for an evanescent mode near 
grazing. 
Asymptotic form for D>>n. If D>>n, then on introducing the usual asymptotic 
form of %, directly in (3) we obtain 


Ky, ~ 1 "(H(— 1) + #.], 
(51) BY, = ee is) ei SP (L£sin dy) 
Va racr cat ys y 


+ 
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where 1/J/i=e7'#*; the next terms of #, involve (s D)—3. Using Euler's” 


summation formula, we write 


#,~ VF [1- V2 Fe) + an "(2 + ae — 78) 


6, =D(i +sin%) — 27,7 =21(m,— Vz), 


(52) 


where y, and »_ are integers, and where F is the Fresnel integral’. (See [4] for 
detailed treatment of (51) for a finite upper limit.) Thus if 6, is not small, then 
His of order D~!. In particular, if 6, is large, then using F(x) ~if2+e*//20% 
shows that the first term of (52) which is then of order D~? is to be dropped for 
consistency. 

On the other hand if 6,-+0 (.e., if a discrete mode is near grazing), then 
using F(x) w /2x/2, we obtain 


: 2 6 bhi) ce 2 ao viata 
(53) Hr | Dé, * Gy oh magi = Cee D cos 65 a 


where G, is finite in the limit. Thus identifying », with uw, and w,+41 of (49) . 


and (50), etc., we see that the correct limiting forms for near grazing modes 
follow directly from the asymptotic representation (51). 
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of some Harmonic Gradient Fields 
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/ 


Introduction 


| 

Probably the simplest problem in classical mechanics is that of the motion 
| of a material point or particle which moves under the influence of a constant 
» force field in Euclidean space. Written in vector form, the equations for such 
! a motion in u-dimensional Euclidean space are 

| 

f (4) Day. 

| where «= (x1, x?,..., x”), wm=(al, «2, ..., a”), and « represents a constant vector}. 
i The force vector « of the system (1) represents a contravariant vector field 
| in n-dimensional Euclidean space and as such can be considered as an infinitesimal 
_ transformation on the space. In particular, a constant vector is an infinitesimal 
translation. The motions considered here are those under the action of force 
| fields that are generalizations of the infinitesimal translations. 

| The generalizations are made in two ways. One way depends on some group- 
_ theoretic properties [3] [4] and provides a generalization in non-Euclidean spaces. 
| The other way in which the generalization is made depends on potential-theoretic 
considerations. The motions discussed here are those under the action of force 
fields which are generalizations of the infinitesimal translations made in this 
latter way. 


An infinitesimal translation in Euclidean space is the gradient of the function 


u=—=)>«' x‘, which is harmonic, vanishes on a hyperplane and is positive on one 
i=1 
side of the hyperplane. Now let D be a convex domain in the Euclidean plane. 
One would like to find a function V harmonic in D, which is positive in D and 
vanishes on dD (the boundary of D). Clearly one cannot expect to have V 
harmonic everywhere in D unless, of course, the requirement that V vanish 
everywhere on dD is dropped. There are two possibilities for V that immediately 
suggest themselves. If V=G(P, Q), where G is the Green’s function of D, then 
V satisfies the requirements in that it is harmonic, vanishes on @D and is positive 
in D. Also, if w=f(z)=w-+7v is the analytic function which maps D onto the 


1 Dots will be used to represent differentiation with respect to time throughout 
the remainder of this paper. Also it is assumed, without loss of generality, that the 
particle has unit mass. 
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upper-half plane, v>0, with éD going onto the real axis, then the function v 


has the property that it is harmonic in D, vanishes on oD, with the exception 
of the point which goes into infinity, and is positive in D. Thus one obtains 
a generalization of the infinitesimal translation by considering the gradient fields ° 


of v or V as infinitesimal transformations. In particular the motions to be 
considered are those governed by the equations 


L=—V 


x? 


(2) oo AN 


In 1954 Firey [2] introduced the concept of ballistically closed sets associated 
with a system of equations 
(3) Cee 
where «x (4) is a vector function in a real Euclidean vector space and A is a bounded 


linear operator. In particular, a closed set F with interior point is said to be 
ballistically closed with respect to (3) if (i) for two points A, #, CF and two 


distinct times ¢, and ¢, there exists a unique solution to (3) such that (4) =A, . 


x (to) = P,; (ii) the path segment between F, and F, lies entirely in F. 
The concept of ballistic closure can easily be introduced for a system of 


equations of the form %=f(x, x). Indeed, if F is defined by Mois 0, then 
ra 


the system (1) is ballistically closed. It is easily shown that in general (2) is © 


not ballistically closed with respect to D. This is done by considering a problem — 


where criteria established by FrrEY can be applied. 


Equations of Motion and Existence of Solutions 


Let D be a convex domain in the Euclidean plane whose boundary contains 
the point at infinity. It is easily shown that for such a domain there exists at 


least one direction such that every line in that direction intersects D in at least — 


a half-line or not at all. Thus without loss of generality it is assumed that the 
intersection of every vertical line with D is either a half-line (possibly the entire 
line) or is empty. Further it is assumed that the half-line extends to infinity 
in the direction of the positive y-axis. Let /(z)=u(x, y)--iv(x, y) be an analytic 
function which maps D conformally onto the upper half-plane v>0 with the 
point at infinity going into infinity and the remainder of éD going into v=0. 
Let /=—gradv; then F is a generalized infinitesimal translation and the 
equations of motion are given by (2). By a solution of (2) are meant two functions 
x(t) and y(t), denoted by z(#), which satisfy (2) and are in D. Also assume that 
t>0. 


Equations (2) yield the energy integral 


(4) aban gions 92 
Since v(x, y)>0 in D, then 0<v<c. Thus one concludes from (4) that 
(5) kSye, ysye 


for z(¢) in D. 


) 
; 
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Since the function v(x, y) is harmonic in D, v, and vy satisfy a Lipschitz 


) condition in every compact subdomain of D. Consequently there exists a unique 


solution for some interval of time for any given initial conditions. 


Proposition 1. The solution z(t) either goes to OD or the solution exists for 


all time. 


Proof. Suppose that z(¢) does not go to dD in finite time. Let ¢, be the 
supremum of all 20 for which the solution exists and suppose that ¢, is finite. 
It is evident from (5) that z(¢) will approach a finite limiting position as t+f). 
From the continuity of v, and v, in D and equations (2), x(f) and y(é) will 
approach a finite limit as t—>¢,. Thus x(¢) and y (é) will approach limiting values 
at 7). Consequently, the solution z(¢) will exist for some ¢>¢,. This contradicts 
the assumption that ¢) was the supremum of all such ¢. Hence ¢) must be infinite. 

Before attempting to discuss the motions governed by equations (2), it is 
necessary to establish some preliminary results concerning the function v(x, y) 
and its behavior in D. 


Behavior of v(x, y) in D 


If again f(z) =u(x, y)+7v(x, y) is an analytic univalent function that maps D 
conformally onto the upper half-plane v>0 and takes the point at infinity into 


_ the point at infinity, then the Green’s function of D is 


) Or 


such that for each z€D 


calf 
4 4U Up 
G (z, %) = — 5 log 1 wae 


Thus it is seen that there is a continuous real-valued function 
(u—Up)® + (U+U 9)? 


Vo 


0 (2, 2) = 


lim, 0(2, 2) G (2,2) =20(%,.y); 


Zy—> 0 
ese (9) = [Mato esbodt | 48 
Bina. ve + uz Uo 
Uptvo . 
set 0 (2, 2) =9 (2, 2) F(Z) Where t(%) =—°——;, then 


Vo 


lime 7 (4G (2, 25) = Oa Vi 


Zy—> 00 
and the convergence is uniform in any compact subdomain of D. Thus for each 
compact subdomain of D we can find a sequence {2 ,} such that 2,—>oo and 


limes (25,) G (2, 25.) 20%, 9). 


Since t(Z,)>0, then for each » the level curves of 1 (%,) G(z, %,) are strictly 
convex [5], so that in the limit the level curves of the function 2u(x, y) are 
convex though not necessarily strictly convex. Since this is true for every 
compact subdomain of D, the level curves of v(x, y) must be convex throughout D. 

There are two domains which are excluded from the present discussion. If 
D isa half plane, then the equations of motion are given by (1), and the resulting 
motions are well known. The vertical infinite strip is also excluded and treated 
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separately later. With these two domains excluded, it can be assumed, because 
of the convexity of D, that D is contained in an angular domain bounded below | 
by two intersecting lines, one with positive slope and one with negative slope. | 

The domains D have been taken in such a way that the intersection of each ° 
vertical line with D is either a half-line or is empty. Let L be the collection of 
all vertical lines intersecting D in a half-line. If 4€L, then the sequence 2, 
can be taken to lie on J,. Since the level curves of 1(2,) G(z, %) are strictly 
convex, they will intersect J, in precisely two points with the point z, between 
them. Thus as 2, > oe along J,, the level curves of v(x, y) must either intersect J, 
in one point or coincide with /,. Since this is true for all JEL, the level curves 
of v(x, y) must intersect every J in one point. Thus since the curve v(x, y) =k>0 
is convex, its slope with respect to the x-axis must be finite everywhere in D. 
The slope is given by 


(6) Loe 


Since v, and v, never vanish simultaneously in D, equation (6) shows that v, 
can never vanish in D. Since v>0 in D and v=0 on @D, then v,>0 in D. 
Let F(w) be the inverse of f(z); then F(w) is analytic and univalent, and 
F'(w) +0 for v>0. Thus the function 
_ Fw) 
is regular analytic, and Im{g(w)} is harmonic for v>0. The condition that the 
curve v=k be convex becomes Im {g(w)}=0. One has the strict convexity with 
strict inequality. If the maximum principle is applied, then either Im {g(w)}>0 
or is identically zero. In the case when 
E(w) | _ 
i { or aay 
then 
Fw) 
7) Fw" 
where a is a real constant. If a=0, then F’’(w)=0, or F(w)=aw+f. This 
implies that D is a half-plane, which has been excluded. If a-+0, then (7) yields 


F(w) = 2 Yt 


which, contrary to hypothesis, is not univalent for v>0. Thus Im {g (w)}>0, 
and the level curves v(x, y) =k are convex. A simple calculation gives 


(8) Vy Vag + QWVy Vy Vey + Vy Vyy > 0 
in. 

Since D is contained in a domain bounded below by two intersecting lines, 
one with positive slope and one with negative slope, the level curve v(x) y) 8 
must have positive slope somewhere and negative slope somewhere in D. Thus 
along any level curve, equation (6) implies that v, will be both positive and 
negative. From the strict convexity of the curves, there will be a unique point 
on each level curve such that v,(«, y)=0. From (8) it is clear that when v,=0, 
Vyy > 01n D, Since v(x, y) is harmonic, it follows that Vy,<0in D. Thus v, (x, y)=0 
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| defines a curve with we ' 


—s AY. 
dy Vex 
j Since v,, never vanishes in D, v,(x*, y)=0 defines x as a single-valued analytic 
' function of y. 
| It is still necessary to determine the behavior of the curve v,(x, y)=0 
; near 0D. By the convexity of D and the way in which we have situated the 
) domain D in the xy-plane, there will be a unique horizontal line of support; 
| call it A, [7]. Again by the convexity of D, 4, will intersect @D in one point 
; or in a single line segment. If A, is any horizontal line above 4,, then A, will 
' intersect 0D in two points, and there will be points of D lying between A, and Ay. 
If A, intersects 0D in one point, then let P be any point of D between 4, 
_and 4,. There is a level curve of v(x, y) passing through P and intersecting A, 
In two distinct points. This level curve is strictly convex and consequently has 

a unique minimum between /, and A,. Thus v, (x, y) =0 somewhere in the region 
_ bounded by A, and @D. Since this is true for all such horizontal lines A,, and 
since y decreases monotonically along the curve v,(x, y)=0, this curve must 
_ approach the point of intersection of 4, and @D. 
| If A, mtersects 0D in a line segment, the above procedure fails since the 
domain between A, and aD approaches a line segment instead of a point as A, 
approaches /4,. This procedure does show that the curve v,(x, y)=0 must 
approach A,. However, it can be shown that v,(x, y)=0 must again approach 
| a definite point. 
| Since y decreases monotonically along the curve v, (x, y)=0, the curve will 
_ either approach a definite point or oscillate and come arbitrarily close to every 
- point of some subinterval of the intersection of A, and @D. Suppose v, (x, vy) =0 


oscillates and comes arbitrarily close to such a subinterval which is denoted by jy : 
The function v,(%, y) is harmonic in D and vanishes on the horizontal line 
segment 4,, and can therefore be extended to be harmonic below /,. Call this 
extension w(x, y). Let P be an interior point of the segment De Then there 
exists an analytic function g(z) such that w=Re {g(z)} and such that g(P) =0. 
It is known that there will be k curves passing through P for which w(x, y) =0, 
k being the order of the zero of g(z) at P, and such that the angle between these 
curves will be z/k. It is also known that in a neighborhood of P that w(x, y)=0 
only along these curves. If k=1, then the line segment hy is the only curve in 
the neighborhood of P for which w(x, y)=0. This contradicts the fact that the 
curve v,(x, y)=0 comes arbitrarily close to P. Thus k= 2, and g’(P)=0. Since 
this must hold for every P in is then g’(z) =0 and g(z) must be constant. This 
contradicts the fact that w(x, y) isnot constant. Consequently, v, (x, vy) =0 cannot 
approach a line segment and must approach a definite point on the intersection 
of A, and 0D. Therefore the curve v,(x, y)=0 divides D into two regions D, 
and D,. In D, v,>0, and v,<0 in Dg. 


Behavior of the Trajectories 
It is clear from equations (2) that j¥<0 in D since v,>0 in D. Thus 4 (i) 
is a strictly decreasing function of ¢. If y(¢) ever becomes negative, then y (t)< 
—n<0 (y a fixed constant) for all subsequent time and y(#)S—yni+6 (6 a 
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constant). Since the right-hand side of this inequality tends to — co as to, 
and since D is bounded from below in the y-direction, it is easy to see that the | 
solution exists only for a finite interval of time [0, ¢)]. From the earlier discussion | 
one can further conclude that z(¢) tends to a definite boundary point as t+%. » 
If 9 (t) vanishes for some value of ¢, then since j (f) <0, y (¢) must become negative | 
in D, and z(t) will behave as above. 

Now suppose that y(¢) is positive throughout the motion of the particle. 
Assume that the trajectory never approaches the boundary of D. Let z(0)=2 
be the initial position of z(f). Then construct a region R bounded by the curve 
v(x, y)=c, 8D and the two vertical lines x= + x, (%;>0) such that the portion 
of the curve v,(x, y)=0 between @D and v(x, y)=c lies completely in R and Z 
also lies in R. Since it has been assumed that the boundary is not approached, 
then there exists a sufficiently narrow strip along ¢D in R which will never be 


entered. Delete this strip from R and one obtains a region R. In Rk, v»>d>0 | 
(6 a fixed constant), and therefore ¥j(¢)< —6 for z(f) in R. Thus y(¢)<—dt+y 
(y a constant). This inequality together with the assumption that y (¢) is always 
positive is only possible if z(t) leaves R. However, z(¢) can leave R only by ° 
crossing *= %, or x= — x,. Suppose z(#) crosses x=4%,. When z(t) crosses x= 4%, 
&(t)=0 and X%()>0. Therefore x(¢) must become positive, and z(é) will never 


re-enter R since to the right of x=x,, ¥(#)>0, and x(t) will be strictly increasing 
for all subsequent time. 

Now consider the trajectory after z(¢) has left R. The slope of the trajectory — 
is given by 


Since 4 (¢) is positive and decreasing while *(t) is positive and increasing, then 
dy/d x is positive and decreasing. Therefore the trajectory is concave with respect 
to the x-axis. Since *(¢) remains above a fixed positive constant, and since 
the trajectory is concave while the boundary is convex, z(¢) reaches the boundary 
in finite time. This contradicts the assumption that z(¢) did not approach the 
boundary. 

Since z(!) must approach @D, it must approach either @D, or @D,. Assume 
that z(t) approaches 6D,; then «(t) must become positive. Thus a discussion 


similar to the above shows that z(f) reaches the boundary in finite time. Sum- 
marizing, one has 


Proposition 2. Let D be a convex domain which is not an infinite strip, and 
let the point at infinity be a boundary point of D. If v (x, y) is a positive harmonic 
function in D with v(x, y)=0 on @D except at infinity, where it is unbounded, 
then each solution of equations (2) will converge to a definite boundary point in 
finite time. 


It should be noted that there is essentially only one such function v(x, y) 
for any given domain D. 
The Infinite Strip 


The special case of the infinite strip must still be treated. There is no loss 
of generality if D is taken to be —ha<x< 3a. Thus v(x, y)=e’cosx and 
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» equations (2) can be written 


(9) lene ECS NO A 
(10) V = — e’ cos x. 


| The curve v,(x, y)=0 is given by x=0. Thus D, becomes —3n2<x%<0 and 


D,, 0<%<$a. One solution to (9) is given by x(¢)=0. In this case (10) gives 


jae, 


_ which has as its solution 


e” =csech? ye 4— | C0. 
\2 


Thus y tends to minus infinity as too. 


Now let z(¢) be a general solution of equations (9) and (10). If «(¢) ever 
becomes negative in D,, then since ¥(f)<0 in D, the solution remains in D,. 
Also since %(t)< — 6 (6 a positive constant) z(¢) will meet the boundary in finite 
time. Similarly, the solution z(é) will meet éD if «(¢) becomes positive in D,. 

Assume now that %(¢) is positive if z(¢) is in D, and is negative if z(f) is in 
D,. Under this assumption a solution can never reach the line x=0 in finite 
time. If this should happen at ¢=/,, we should have x(t,)=0 and %«(t,)=0. 
But equation (9) considered as an equation of x(¢) alone is of second order and 
is satisfied by x(t)=0. Since x(f,)=0 and x(f,)=0, the uniqueness theorem 
for differential equations guarantees that x (¢) =0. The solution therefore remains 
completely in D, or D,. In this case, considerations similar to previous ones 
lead to the result that z(¢) is defined for all ¢>0, and x(¢)->0 while y (¢) +— co 
as too. 
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On the Solution of Nu-—q(x)u=f (x,y) 


over a Rectangular Domain 


TsuAN Wu TING 


Communicated by C. MULLER 


1. Introduction * 

Let an open rectangle G with boundary I” be defined in 0<x<a, 0<y<8. 
The boundary-value problem under consideration is to find a solution w(x, y) 
such that w—0 on J’ is twice continuously differentiable in G, satisfies the 
equation 


(1) Au — q(x) u=f(x, y) 
everywhere in G, and is such that the improper integral [7] 
SJ (uz + uy +) dxdy 


exists. The functions f(x, y), g(x) and their first partial derivatives are assumed 
to be Hélder-continuous in G+J’, and g(x)=0 in G+I’. A particular case of 
this problem is to find the solution of REyNoLpDs’ equation [2] for partial journal 
bearings, which can be reduced to (1) by a simple transformation of the function 
under consideration. 

It is known [1] that a unique solution to this boundary-value problem exists 
and is the unique solution of the following variational problem: To find a solution 
function u(x, y), vanishing on J’, which minimizes the variational expression 


(2) Jt] = SS fi + vy + 9(9) w+ 2i (x, 9) phaxdy. 


The admissible class of functions y(x, y) for comparison consists in those functions 
that are continuous in G, that have piecewise continuous derivatives in G, and 
that vanish on /’. Furthermore, the improper integral 


it (p+ yety)dxdy 


exists. Conversely, the unique solution to the variational problem is the solution 
of our boundary-value problem. 

Based on these known results, together with those of Hopr [3], the purpose 
of this note is to expand the solution u(x, y) into a double Fourier series, to 


* This paper was written while the author was at the General Motors Research 
Laboratory, Warren, Michigan. An abstract of part of the results was published 
in the Notices, A.M.S., pp. 914—915, December, 1960. 
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determine the order of magnitude of the Fourier coefficients, and to exhibit a 
convergent minimizing sequence of the variational problem. Thus, a method is 
provided for the approximate determination of the solution, and a computable 
bound for the approximate solutions is exhibited. 

2. Reduction to the problem of solution of an infinite system of linear 
algebraic equations 
a Da Mt, Oa V0. 


| 


Then (1) and (2) can be written respectively as 


6 (2p + (22% aoe —r0.9) 


(4) Jtyl = hs (2 oo) + (E95) +av+2iy} dodo, 


where, for simplicity, we have used the same letter G to denote the open square, 
—0<0<a, 0<g<aq, and where its boundary will also be denoted by I’. 
Since it is already known that a unique solution u(#, y) exists and the partial 


derivatives of w up to the third order will be Hélder-continuous in G, we may 
restrict the admissible class of functions. Accordingly, as admissible functions 
| we consider w (#, gy) that vanish on [and that have the Fourier series expansions [4] 
( lo) 
wy (3, ~) = Di %m,, Sin md sin n p 
. m,n=1 
in G+TI and 
oy el 9 
ao = dm cosmo sinn Gy, 
m,n=1 
B [ee) 
| a DY) 2 Xm,n Sin MO cos ny, 
m,n=1 


in G, where 


linn = az [ ysinmésinn pdddg. 
G 


From the Parseval relations [4], (4) can be written as 
a af m? se oo eae 
oS (Gt Ha 

+ i: (2 


ES hay ae sil fsnmésinn pdbdg. 


(sin md sin n ) dodp— 277 >) By Kinin? 


Mm, n= ee m,n=1 


_ where 


The vanishing of the variation of J with respect to each parameter ¥,, ,, yields [9] 


(5) (+ Xn ae (qsinmdsinn g) udidy =2* B,, 
Fi 
G 
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for m,n—1,2,.... Again, from the Parseval relations for the functions g sin m@} 
sin np and u, (5) can be written as 


2 


2 2 & Z 

(6) (+ Fa n,n > Aine tn = Oma (m,n =1, 27S i 
¥=1 

where 


A ,= 2 [ qsinmdsinr 9 dd. 
1 
0 


This shows that the Fourier coefficients of the solution u (0, Y) of our boundary- - 
value problem are determined by the linear algebraic equations in (6). 


3. Order of magnitude of the Fourier coefficients x, » 
The main result of this section is to show that the unique bounded solution, | 
{%m ,}, Of (6) is such that 


(7) *m,n ecru ay 


We shall see that this estimate is the best possible if q(#) or /(3, ~) satisfies only 
the conditions stated at the beginning of this paper. Furthermore, this result » 
will enable us to obtain a computable bound for the maximum error and to) 
estimate the maximum rate of convergence of the approximate solutions as | 
defined in this paper. 

To prove the assertion (7), we shall make use of the theorems of Kocu and _ 
HILBERT [6] and put 

_ ao 

(8) Xm n 7 (m?/a? “i n® |b?) Xm, n? Aye: n= ae CoE . 


Under these transformations (6) becomes 


is) — 
(9) Line Dame: eee a (m,n =1,2,...). 
aS 


It is true that the existence and uniqueness of the bounded solution of the 
system (6) is a necessary consequence of the existence and uniqueness proof for 
the solution of our variational problem. It is also clear that a bounded solution, 
{m,n}, of the system (9) will yield a bounded solution, {m,n}. of the system (6). 
However, a bounded solution, {x,, ,}, of the system (6) need not be a bounded 
solution of the system (9) under the transformation (8). Accordingly, the existence 
and uniqueness of the bounded solution to (9) has to be proved. The purpose 
of this proof is to obtain the order of magnitude of the Fourier coefficients Lone 
as given by (7). Further, we observe that the index n may be considered as 


fixed, and the system (9) yields equations for determining the unknowns, {%»,»}, 
with respect to the index m. 


We now show that, for any value of , the system (9) satisfies Koch’s as 
well as Hilbert’s conditions. The justification will be completed if the three series | 
Oi ees co 


pa |Ain, m,n| > ph (A, Y, niet >; (oy Pa 


m=1 m,r=1 m=1 
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are proved to be convergent, and the homogeneous system corresponding to (9) 


has only a trivial solution. But 


m=1 
li faas) > 


0 


Sa ies |A mn, m| 
al m, all a 70 ( (m?/a2+-n2/b?) 


2 


ae (m2/a2-++-n?'b2), 


Pen oes 2 


5 (7?2/a?-+-n2/b2) | 


2 fee ale cosy i cos m 0) add 
0 


= 9% 


Vera 
— 14 
Ss const. > a 
m? 72 
m,7=1 
and BS 
m=1 m, n= 


mm v/a? 


Thus, these three series are indeed convergent . all values of 1. 


Let {@,,,} be the solution of the homogeneous systems corresponding to (9), 7.e. 


i) — 
) (4 0) Ory, n = Ds ay Yn O, n 
(ail 


=0 


) 


In order that the left sides of these equations have meaning, the sequences 
{@,,,,} have, of course, to be bounded. In fact, we have stronger conditions 


to be satisfied by {Om,n}, namely, 


=~ 1 
ee (WA Dawe) 
because he S&S 3, » 
| (a5) <= A Am, Y Yn 
m,n 2. m,r,n QW, wl Fert aie 
Cc 
1 const. es by \\t=2 
= = 2 
< const. ae art wyBE = jtan ae ale = 
Define = 
m,n ae 
Onn = m?/a2+ n2/b2 (m, — Ai 2, ) 
Then (10) becomes 
m= , ne 
(11) (7 +52) mnt J Am o, 72 =0  (mn=1,2,...). 
Consider the sequences, {7 @»,,}, {%@m.n}, Of real numbers. Since 
M Om m const. 
= =< ci 
Mm Om, n m?/a?+-n?/b2 sare n(m?/a2+-n2/b2) — mn ? 
N Oy, n const. 
= za 
"Om, n m?/a? + n?/b? aye et n(mja2+ n2/b?) — mn ’ 
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24 
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it follows that 


fore) 
(m On a << [o,2) , bo (n Dy, a cele 
: m,n=1 


*Ms 


m, 


By the Riesz-Fisher theorem, [7], [4], there are square-integrable functions, 
09(9, 9), 0, (9, y), defined in G+’, such that 


wd odd dy — os (1.0, n)?s alt edidp= > $1 (Om 9) 
Yes m,n=1 
and 
4 : 
Mm n =e f o,cosmésinn pdbdg, 
G 
eet [ sinm? cosn pdtd 
MOm n = 7 Op ? si 
fe 
Define : 
9,9) = S 0549; 
then [4] | 
oe) g 
0(9, 9) = X Om, snmdsinn yp = f 0,49, 
m,n=1 —72n 
4 : F 
Om,n = =e [ esinmdsinn pdb dg. 
G 
Hence the notations 
ee RL, 


are justified. 


By multiplying (11) by o,,,, and summing over the indices m,n, we obtain — 


(12) (n8]02 + W888) 8,6 + Dn ( 3 Ayer ult) 0. 
mM, n= m,n=1 — 
Define 
Avge a = [[ (qsinm?@sinn g) sinrdsins pdbdg. 
G 
Then 
Apaah = Aging: 
and 
ss 2, Amr Orn = = en Coe a if (gsinm#dsinn ~) odddq@. 
fi 7,S=1 
G 


This is legitimate because the single and the double series are both absolutely 
convergent, and they are equal to the integral on the right in view of the Parseval 
relations for the functions @ and g sin m# sin n gy. Now (12) becomes 


=r = Jf leslae+ (2, i} d0do + Y Omak J, qgesnm#ésinn pdddy=0. 
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Applying Parseval relations for the functions @ and ge to this equation, we have 


S[[ (Geos Gest ree aoay—o 
G 


Since g= 0, this is possible if and only if 


Ils Te eOes =i ae )d0dp = ef peatst so 


! Thus 


Din 0 (Wyh= FeO. .3)ls 
: and 
Dyn = (In?|a? + n2/6%) Wy, = 0 


_ for all finite values of m and n. These results together with the equations in (10) 


imply 
lim ,,, = lim @,,,= lim @,,,—=0. 


m—> CO nN—> CO u m,N—> OO 


Hence, we have proved that 


On,n = 9 sea 


- and the homogeneous systems corresponding to (9) have only trivial solutions 
_ for any value of n. 
Now, Hilbert’s theorem asserts that for any fixed value of 1, the system (9) 
CO 
Misuchithat® 44) <0. 
m=1 


_has only one solution, {%,,, , 


To prove the assertion given by (7), we consider the inequality 


(13) \iner| = 1B an 


m,r,n xX, n 


By the Lagrange-Cauchy inequality [6], we have 


But, for all values 37 nN 


pa EA = BD) Tr, = o(4), gs (%, ,) 2 < 00; 
thus 
(14) DArmnakrn =0()  (#=1,2,...) 


Further it can be verified by partial integration that 
io 

(15) Banos =0( 5) . 

Now from (13), (44) and (15), we have 


»=0(= (oe ale 


24* 
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Using this result, we can write 


Dr f (@ siny dcosm #+grcosy 3 cos mB) dd 
0 se 


fe. @) ae = ore i 
Di Am, rn %,n| S OF men = 72 m ("a2 + n2/b2) 
v=1 vy=1 
jac qo sin r 8 cosm 3-+g cos # cos m8) dd 
oe Y 
owl A Lh : 
(16) < const 2, MEME 
= 
const. f 4 by [rar 20 1 
Sinn lee aa (sx): 


Now (7) is a direct consequence of (13), (15), (16) and (18). As can be seen from 
the equations in (9), the estimate (7) is the best possible if /(#, p) or g(#) satisfies 
only the conditions imposed on them. 


Because of the absolute convergence of the corresponding majorant series, 


— const. ~ const. ~ const. 
os mn(m=/a2--n2/b?) ’ 2a m (m?/a? +-n?/b?) ’ 12 n (m?/a?-+-n?/b2) ” 
the series 
foe) oe} co 
Dd Xm nSinmIsinng, LY MXp,,coomosinngy, 0X, ,sinmd cos np 
m,n=1 m,n=1 m,n=1 


converge absolutely and uniformly in G. This also shows some resemblance 
between our solution and the solution of the equation Au=f(x, y), [11]. 


4. Method for finding an approximate solution-minimizing sequence 


Even though the convergent behavior of the left sides in (9) is known, in 
general such an infinite system can not be solved exactly. For approximate 
solutions, we consider the sequence, {yyy}, defined by 


N 
(17) Wy = D>, «4, sin m# sin ny, 
m,n=1 


where the coefficients, «{"),, are determined by the finite system of linear algebraic 


equations 


N 
(18) a (m?/a? si n®/b?) a se 2 ye r sae = Bae n? 
r=1 
m,n=1,2,...,N. By using the same technique as before, it can be shown 


that the finite system (18) always has a unique solution for any value of the 


integer N. Moreover, using Kocu’s results [6] on normal determinants, we can 
prove that 


N 
oy ee as N->oo. 


In fact, it is easy to see that {y~y)} is a minimizing sequence for the Dirichlet 
integral (4). Accordingly [7], yy) is an approximate solution in the sense that 


lx) — #l| approaches zero as N->oo. But we shall obtain stronger results than 
these in the next section. 
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The method of finding an approximate solution by solving the truncated 
finite system (18) is known [8] to be convenient. The essential feature of the 
approximate solution yy) is that it has the closed form (17). Also we note that 
the same method for finding approximate solutions can be applied to the non- 
homogeneous boundary conditions, provided the boundary-value function, g, 
defined on J’, can be extended to a function defined on G+J’ in such a way 
that its partial derivatives up to the third order are Hélder-continuous in G+TJ. 


5. Maximum-error estimation for the approximate solutions 

KANTOROVICH proved a general theorem [9], [10] about the rate of con- 
vergence of the approximate solutions found by minimizing the Dirichlet integral. 
However, for this problem a stronger result can be obtained. Inspired by Kry- 
-Lov’s work [9] on ordinary differential equations, we shall present a comput- 
able bound for the maximum error, max |yy)—u|. To show that this bound 
will also indicate the maximum rate of the convergence of the approximate 
solutions, yy), defined by (17), we shall compare y,y) with the segment of the 
double sine series of the exact solution w. 

The systems of equations in (6) and (18) can be written as 


[o.@) 


1 
(19) Xm, n | 72 (m?/a2+-n?/b?) oe Aye xy a Pe niy Toe (m?/a? ss n?/b?) , 
Va 
NO a, Alla 
i CO 
(20) aN) 4 : » Ans Xp n= Bm, lz? (mPa? + n?/b?) 


mi? (m?/a®—+-n?/b?) 24 


| Bie 12,...., IV), where a) = 0 for r2N-+1 or n2N+1. Denote by %y) 
-and R,y) the errors of the approximate solutions and the errors of the segment 
of the double-sine series of w, 1.e., 


ro) 
N . - 
YN) = We (ao oS, Kinga) sin m1 # sin 2 Y, 
| m,n=1 
| oo 
Rwy = — Dd %m,nSinms sin n —. 
mn=N+1 


It is not difficult to see that we can construct the following two identities from 
the systems of equations in (19) and (20), namely, 


N ; 

4sinm #sin n p a . ‘ 

| ; 7nysinmésinny dé dr 

1N) Dea eBT J. gn) § / g / 
G 

SI 4sinm?dsin n p 


"4 te (m?/a2+n?/b?) | 


(21) = gusinmésinny dé dyn 


Gi 
os S By,Sinm O sin n @ , 


— (m2/a? +-n?/b2) wre 


m,n=N 


) m,n=N-+1 


co . . 
4sinm } sin 2 p [ : . dé dt 
= usin m€ sin 17 7 
Rw) eS on®jattnaey J} 7 g 7 dé dy 
mn=N+1 G 
2) <1 Bu nsinmésin n 


a? (m?/a?+ n2/b?) 


m,n=N+1 
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From the two identities (21) and (22), we have [4], [7] 


ules Ua Y 7 
roy] S Rav 4 > 2 th (m?/a2+-n2/b?) jp erosin Seana | 
= Tara in mf silt 4 
Ss |Rowy| a I pee (m?/a2-+_n2/b?) sl U 
m,n= ia 
4sinm @sin n yp Fsin'h “ded 4 
= |Rwv| ig um Din tm 2a? + n2/b2) sin m n q 


N 5 1 z 
; 4(sin m sin n ¢)? |? 1 2 
= [Ron : 2 poesia: | eres | [faroas in| 
G 


r 
[[aroasan ; 
G 


Thus, to estimate |7,y)| we need only to give estimates for the first and second 


m,n=1 


b 
<|Rwwy| + <7; max q? 


terms on the right-hand side of the last inequality in (23). 


First, we give an estimate for ff g7{y)d0dq. Multiplying both sides of the 
G 


identity (21) by qv) and integrating them over the domain G, we obtain 


N 4({f ax) sin m sin ny dbdg)" 
G 


[J 97s ddp+ > 


m,n=1 


mt (m?/a? + n?/b?) 


4 oO 
pki ea ee mesin ny dean) indy] at 


m,n= ga 


By, Sin m 6 sin n Q 


- n=N+1 


Since the first and the second terms on the left side of this inequality are both | 


positive, we have 


Fic 


— 64 8 
<|{ aren > ee [[avsinmesinnnatdn)aodg| + 


m,n= Zs 


B sinm ?sin 2 
+| Lferm( > 7a ? 


es oe a4 (m?/a? + n®/b?) 


Dmateaay ) ‘9 7 


‘aoa. 


| 
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Now we give estimates for the first and the second terms on the right-hand 
side of the inequality (24). The first term is less than or equal to 


[ener 
G 


4sin m ? si 
«Hil > nt IGE [lpsesermt sina a0dg) 


mn=N+1 


(25) <max¢ [fart vag] | 
G 
Se ae | 


4 fore) 
raat arin adda) | en (-£ x3 n (m?/a? +n?/b?) 


m,n=N+1 
< 4 dodgy] 2 lallinel 
Cas Sderot 


where we have fab use of the Riesz-Fisher theorem and where ||| denotes 
the L,-norm of g. To estimate (4). If u 
minimizes this integral, then [9] 


a) us 4 (zy uz + gu? fu} dd dp =0. 


x 


se 2ffqusinm Esinnn dé dn 274 
G 
( m3 (m?/a? + n?/b?) | 


mn=N-+1 


1 
2 


Consequently, 
(FF tes| [fmaoan| = lel 
fe 
‘But D As Be i 
Soult — 3° (unl S SCs)? = 4 
Hence 


lve S (2) Ml 


From this inequality and the last inequality in (25), we know that the first 
term on the right-hand side of the inequality (24) is less than or equal to 


aA 2 * ad aiitl_ 
(26) max q [[fartnaoee] yaar 


The second term on the right-hand side of the inequality (24) is less than 
or equal to 


Byjnsinm $ sin n p 
[J anwstabag) (3 m,n=N-+1 7 (mn ]a?+n2/b*) : } ‘ody 
t — (Bin, n)?/4 5 
< max ¢ i qin) andy) Ds 7 ae sie 
. m,n=N-+1 


Since 


f 2 
(Bun)? =| al fsinmdsinng dd iy) 
G 


IA 


(2) 140. ol + 14 ol + toll, 
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the second term on the right-hand side of (24) is less than or equal to 


: of SG WO, OI+Ik(z wl+lfol)? 
If ary adbag S| pa m? (m?/a®-+n?/b2)? 
G 


max g! 


m,n=N+1 
(27) | | 
< 20/0, 9+ D+) < max gi 
V3.8 (N-+1)? 


ik ary ad dg) 
G 
Finally, from the inequalities (24), (26) and (27), we see that 


7 ees J a B° lqi||i|| _ ad(IF(0, p+ (a, 9)||+llfol 
28) [ff etn eoae) sae (ee Lele oH 


and the estimate for the second term on the right-hand side of the last inequality 
in (23) is completed. 

From the inequalities in (23) and (28), we see that to estimate |ry)| we need 
only to give an estimate for |R,y)|. From (22), we have 


[Ruy| S 


XE 4sin m # sin n p 
2/a2-+-n?/b?) 


rT qusin mg sin wy dédy] 


pag ee a at (m 


co 


7 
3 BynSinm sin n | 
= 


ir 72 (m?/a®-+n2/b2) 


m,n=N+1 


Now, the first term on the right-hand side of (29) is equal to 


4sin m # sin n p 


Sn (ma? n)62) [fa sinmécosnyn dé dy 
G 


mn=N+1 ue 
4sin m sin : 2 3 q 
= Fiatey oe ante nT SH ee Coote n| dé in| | [[ (qu,)? dé in| 
(30) Co ea v 
—_ 4(sin m # sin n 
s[ y Sones a Tell 
< 4b half 


~~ wV3(N+4)2 ’ 
and the second term on the right-hand side of (29) is equal to 


4sin m # sin n 2 


z4 (m2/a2-+ 0/62) [J {sin Femina dedn 


m,n=N+1 
aS — 4FG S14 life (0 PII+llf (x, I) 
= 2, mm n (m*/a? + n2/b2) . ae x mn (om |a2-+ nb?) ag 
n ~ 4 (sin m # sin n gy)? 
(3 ) ai eas 8 m2 (m?/a®+ n2/b2)2 ‘| I7ol 
< 2ad|f(3)| | 242 (lio MItlfo(x, oll), a bllfol 
mt (N +1)? m4 (N-+1)2 ig 


m3 (N-+1)? 


< we {FG + zl 
=o y2 Ip (0, @)!| + 
iN ne {ltl 9) | + |f, (7, ») || + ae 


Solution of a partial differential equation 315i 


Becre [#0] =|4(0,0)| +140, 0] + fla, 0)| +] 4 (a, 20], 
ey reer iat 
lf, (0, %)|| =| G5)<4° If, (7, ») | = J (Selnnt?: 
Hence : : 
2ab fb laiiiil ar fol 
oe) (Riv) = mee ee [7 (@,9)| + Mel. N+ Meee) + S08, 


and the estimates for |R,y)| are completed. 


Finally, from the inequalities (23), (32) and (28), we see that 


2ab b? ab? ae 
a rool S saareaye (ayge tpg ce BOX 2) [eM + [7G] +10, 991+ 
+ Melee @)| + SE 4 EOE (1400, ol + le Ol + Wold} 


This is the final form of our estimation. Since g and / are known functions, the 
right-hand side of this inequality gives a computable bound for the maximum 
error, max |7y)|, and the integer N indicates the rate of convergence of the 
approximate solutions yy). If f=f() only, as in the case of REYNOLDs’ equation 
for finite partial journal bearings, then 


2ab OA eats 0F 
rool Ssaavtay ayaa t ypow B24) bal lA + 317(001 + 
- +5] / (9 + 2a 4 2hmax4 YH 
| I 213 I 13 22 o fe 


To show that the inequalities (33) or (34) also indicate the maximum rate of 


‘the convergence of yy) to the exact solution w, we compare the approximate 
solutions yy) with the segments of the double-sine series of w. From the in- 


equality (23), we have 
2 
2ab , 
= max q? [[fartmao ay] 
G 


; N 4( {arm sin m Osinn 948 dq)’ 
[favindode pees x (m?|a® +-n2/b?) = [farm Ron dbdp. 
G 


C m,n=1 


max |”y)| S max |Ruyy| + 


Also we have 


| Hence 
IJ q Vin) 40 dey < ||Rqy |? max @. 


Thus 
. 2ab 
max |7y)| S max | Rwy)| + —e- [Ray | max g- 
Consequently, 
max|yw)l < 14 2¢>lRal_ max ¢ 
max |Rcy)| ~~ x max |e) 
(35) 2ab 
| S15 max q. 


4 
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Similarly, 
Rel = Pool +] So Simatragan pf fon sinm sin nn dE dy 
m,n G 


-, m4 (m?/a?-+n®/b?) 


b 
< max |r| +22 lal Pool 


Hence 


max |R(y)| pag pee sll | | Ivcwnlh 
Mh 


max |”y)| max |r(y)| 


(36) x14 2Plal 


From the inequalities (35) and (36), we see that 7) converges to zero at the 
same rate as Ry). That is, the rate of convergence of the approximate solutions 
Way) to the exact solution « is the same as that of the double Fourier series 
for uw. Since the order of magnitude of the Fourier coefficients of w is given by 
(7), we can conclude that the estimates given by (33) and (34) are the best possible 
insofar as the rate of convergence is concerned. 
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